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Preface 



The subject of this book is the investigation of tree transducers. Tree trans- 
ducers were introduced in theoretical computer science in order to study 
the general properties of formal models which give semantics to context-free 
languages in a syntax-directed way. Such formal models include attribute 
grammars with synthesized attributes only, denotational semantics, and at- 
tribute grammars (with synthesized and inherited attributes). However, these 
formal models share certain constituents which are irrelevant in the investi- 
gation of the general properties considered in this book. In particular, we 
can abstract (a) from derivation trees of the context-free grammar and take 
trees over some ranked alphabet, (b) from the semantic domain of the model 
and use the initial term algebra instead, and finally (c) from the machine- 
oriented computation paradigm, which maintains the incarnation information 
of recursive function calls, and take a term rewriting semantics instead. Ap- 
plying these three abstraction steps to attribute grammars with synthesized 
attributes only, to denotational semantics, and to attribute grammars we 
obtain the concepts of top-down tree transducer^ macro tree transducer^ and 
attributed tree transducer^ respectively. The macro attributed tree transducer 
combines the concepts of the macro tree transducer and the attributed tree 
transducer. This book explores the general properties of these four types of 
tree transducers. 

The general properties which we examine are the transformational power 
of the tree transducers above, the restriction of their resources, and an inten- 
sive investigation into the composition and decomposition of the tree trans- 
formations induced by them. Our aim is to develop a theory of the these types 
of tree transducers and thereby contribute to theoretical computer science. 

We intended this book for those who work in an academic environment, 
e.g., in universities or research institutes, and who deal with theoretical com- 
puter science. It is a small handbook of recent results on tree transducers 
that can be used both for carrying out further research and giving lectures in 
the topic. The subject can also be taught to graduate students in computer 
science. However, as prerequisites, some basic knowledge of universal algebra 
and of automata and formal language theory is necessary to use the book 
effectively. 




VIII Preface 



The background to the book is as follows. We began to carry out research 
into the theory of tree transducers independently from each other many years 
ago. Since meeting for the first time in 1986 in Szeged, Hungary, we have con- 
tinued a constant discussion about the state of the art. We have attempted 
to systematize the results from different research schools using a unified ter- 
minology. Some years ago we decided to write down our knowledge in the 
form of a book and the result is in your hands. 

The authors are grateful to many colleagues who in one way or another 
contributed to this book. The first author was introduced to the subject 
by Ferenc Gecseg and benefited much from valuable discussions with Zoltan 
Esik, Sandor Vagvolgyi, and Magnus Steinby. The second author had the 
luck to meet Joost and Louco Engelfriet and, also, to learn the theory of 
tree transducers from Joost. He had fruitful discussions about the subject 
with Armin Kiihnemann and Thomas Noll. Both authors are very grateful 
to Armin Kiihnemann for careful reading of the manuscript. They would 
also like to thank Sven Buchholz, Matthias Gette, and Christian Lescher for 
taking over the troublesome task of drawing the figures. Many thanks are also 
due to the computer system managers Karoly Devenyi and Sebastian Maneth 
who maintained the intensive computer connection between our universities 
on which our joint progress was based. 

Last but not least, the authors are indebted to Springer-Verlag, especially 
to Dr. Hans Wossner and his team, for the pleasant and highly professional 
cooperation during the preparation of the camera ready version of this mono- 
graph. 

Our work was supported by different grants. We received much assis- 
tance from the German-Hungarian Research Project “Formal Languages, 
Automata and Petri-Nets” (1995-1997) supported by the Te^T Foundation^ 
Budapest, Hungary, grant no. D/102 (formerly: No. OMFB-NPI-102) and by 
the Forschungszentrum Karlsruhe, Germany, grant no. 233.6, respectively. 
We were also supported by the Hungarian Scientific Foundation (OTKA), 
grant no. T 015843 and by the Ministry of Culture and Education of Hun- 
gary, grant no. FKFP 0095/1997. 
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1. Introduction 



1.1 Syntax-Directed Semantics 

In computer science one frequently encounters the task of specifying the se- 
mantics of a context-free language. In detail, this means that there is a lan- 
guage L generated by a context-free grammar and every string w E L has 
a meaning, called the semantics of w] the semantics is an element of the 
carrier set of some semantic domain. Let us call the whole machinery which 
computes the semantics of w the semantics evaluator (Fig. 1.1). A typical oc- 
currence is where L is a programming language, i.e, the strings are programs, 
and the semantics of a program w is the assembly code which is obtained by 
translating w into some assembly language. 



semantics evaluator 



strmg w_^ string 
of L 



value! 



semantics of w 
as value of some 
carrier set 



Fig. 1.1. The semantics evaluator 



The abstract method 

A well-known abstract method of specifying the semantics of a context-free 
language is the so-called syntax-directed semantics. 

Syntax- directed semantics is based on the idea that the semantics of a 
string of a context-free language is defined in terms of the semantics 
of the syntactic substructures of the string. 
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Thus, if the semantics is specified in a syntax-directed way, it is neces- 
sary that the string w E L is already parsed before the computation of its 
semantic value can begin. Hence the semantics evaluator is subdivided into 
a parser which constructs the derivation tree ty, of and a syntax- directed 
semantics evaluator which computes the value of tyj (Fig. 1.2; in figures we 
often abbreviate the term “syntax-directed semantics” to “sds”). 





parser 


sds evaluator 




— string 


derivation 

tree 


value 















Fig. 1.2. The sds evaluator 



In this book we are not interested in parsing. Instead we focus on formal 
models which obey the abstract specification method of syntax-directed se- 
mantics. In Fig. 1.2 we indicate this by drawing a box with bold lines around 
the syntax-directed semantics evaluator. Also in the other relevant figures in 
Sections 1.1 and 1.2 we indicate our focus by bold lines. 

In order to compute the syntax-directed semantics of a derivation tree ty, 
we need two modules (Fig. 1.3): 

• a specification S of the syntax-directed semantics in some specification 
language and 

• a computation paradigm which defines how to compute the semantic value 
of a derivation tree tyj according to S. 

A combination of both, i.e., a specification language which is based on the 
concept of syntax-directed semantics, and an appropriate computation para- 
digm, is called a formal model of syntax- directed semantics. We will develop 
three different such formal models. More precisely, the first formal model is 
based on the abstract method as defined above, whereas the second and third 
formal models are based on a more flexible, extended abstract method which 
will be defined later. 

In the following we will assume that the context-free language L is gen- 
erated by the context-free grammar G = (iV, T, P, 5). 
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Fig. 1.3. The sds evaluator based on two modules 



A first formal model; attribute grammars 
with synthesized attributes only 

The first formal model of syntax-directed semantics which we are going to 
examine, is called attribute grammars with synthesized attributes only. 

First we must realize that a string w E L may have several semantics, 
e.g., the length of w or the number of occurrences of a particular symbol in 
w. Hence, we need a finite set M the elements of which are called “meaning 
names” (e.g., length and number-of-a for some symbol a can be such mean- 
ing names). Roughly speaking, every meaning name represents a semantics 
of w. More precisely, for every meaning name m £ M^vfe specify a set Vm of 
values, called the carrier set. The m-meaning of w is then an element of Vm- 
Moreover, we distinguish a “principal” meaning name mo which represents 
the meaning of w in which we are interested. The other meaning names may 
be considered as auxiliary ones. 

To every nonterminal A E N of the context-free grammar, some of the 
meaning names are associated by means of the function a : N ^ 'P(M), 
where V{M) denotes the power set of M. Moreover, to every production 

p:A-^ WoAiWi . ..Wn--lAnWny 

(with nonterminals A,Ai,...,An) and every meaning name m G a(A), a 
semantic equation of the form 



m{A) = < 



is associated which is called the {m^ A) -equation ofp. Its right-hand side ^ is 
an expression built up from basic functions over carrier sets and compound 
objects of the form mj{Ai.) with ruj G and 1 < ij < n. Intuitively, 

the (m, A)-equation of p expresses the m-meaning of a string w which has 
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been derived from A by starting with production p. If mi(A,j), . . . , mr(Ai^) 
are the compound objects which occur in C, then for every 1 < j < r, the 
m-meaning of w depends on the mj -meaning of the substring of w derived 
from Aij, Hence, the computation of the m-meaning of A assumes that, for 
every 1 < j < r, the mj-meaning of Aij is known. 

Let us denote the system of the productions of G together with the corre- 
sponding sets of associated semantic equations by Ssem- This system clearly 
forms a specification of the semantics of L(G), i.e., the language generated 
by G, in a syntax-directed way. Since Ssem itself is not executable, we first 
transcribe it into a system Sjp of recursive function procedures (Fig. 1.4) and 
then use the usual, well-known computation paradigm for recursive function 
procedures. More precisely, for every meaning name m, we construct a recur- 



specification 
by ^sem 



transcription 



system 

Sjp 



parser 



sds evaluator 



string 



derivation 

tree 




value 



Fig. 1.4. Specification by Saem and its transcription into Sjp 



sive function procedure m-eval which has one formal parameter d\ the actual 
value of this parameter is a node of some derivation tree i. The body of m- 
eval just consists of one case statement. For every production p for which 
there is an (m, .A)-equation of p, this case statement contains one clause. 
More precisely, if m{A) = C is A)-equation of p, then 

p : m-eval := 

is a clause in the case statement and C is obtained from C by replacing 
every compound object rrij{Ai-) by the function call mj-eval{dij) where dij 
denotes the ij -th descendant of d. We formulate the function procedure in an 
imperative programming language style as shown in Fig. 1.5. 








1.1 Syntax-Directed Semantics 



5 



func m-eval{d: node): Kn ; 

begin 

case production applied at d of 
p: m-eval := 

end 

end m-eval 



Fig. 1.5. The function procedure m-eval 



As already mentioned, the semantics of w is obtained in the usual way by 
calling the function procedure mo-eval at the root of the derivation tree 
of w. In fact, the underlying computation paradigm is defined by a machine 
and a compiler; we will discuss this machine-oriented computation paradigm 
in a later subsection. 

We note that in the area of attribute grammars the meaning names are 
called synthesized attributes. 

An example of an attribute grammar 
with synthesized attributes only 

Let us illustrate this formal model by discussing an example in detail. Let Go 
be the usual context-free grammar which generates arithmetic expressions 
built up from the variables a, 6, and c, the operation symbols , and ♦, 
and the two parentheses ( and ). Hence, Go uses the nonterminals 

E (start symbol) for generating expressions, 

T for generating tags of expressions, 

F for generating factors, and 

I for generating identifiers 



it contains the productions 




Pi : E — ► E -{-T 


P2 • E — >■ E — T 


pz '■ E —* T 


P4 : T —^T^F 


P^-.T^F 




P6-.F-^ (E) 


PT.F^ I 




P& : I ^ a 


P9 -. I ^ b 


pio : / ^ c. 



We note that Go can easily be extended such that it generates infinitely many 
identifiers. However, three identifiers are sufficient for illustrating this formal 
model. 

It should be clear that a natural semantics can be associated with every 
expression e. Let us call this semantics the expression semantics of e and 
denote it by £^|e]. It is a function such that, for every ni, ri 2 , ns G N, where N 
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is the set of natural numbers, 5|e](ni, ri 2 , ns) is the value of e if we replace a, 
6, and c by ni, n 2 , and ri 3 , respectively. Thus, e.g., £^|[a*(6 + a)|(3, 2, 5) = 15. 

We now specify another semantics of an expression e E i^(Go) which we 
call the code semantics of e. It is an assembly program ass-prog{e) such 
that, if asS‘prog{e) is executed on a machine A1, the machine computes the 
expression semantics £lej of e. In other words, we describe a small compiler 
for arithmetic expressions into assembly code which should be correct with 
respect to the expression semantics. Hence, the expression semantics is our 
reference semantics for the correctness of the compiler. 

The machine M has an accumulator and a memory which consists of 
memory locations (Fig. 1.6). The accumulator and every memory location 
can store a value from N. There are three memory locations referenced by 
the addresses a, 6, and c, respectively. At the beginning of the computation, 
these locations are supposed to keep the values of the identifiers a, 6, and c, 
respectively. Moreover, for every i G N, there is a memory location referenced 
by the address i. For every address i 6 {a, 6, c}UN, let c{i) denote the contents 
of the location with address i. The machine accepts the following assembly 
instructions: 




Fig. 1.6. The configuration of the machine M 



LOAD i 
STORE i 
ADDi 
SUB i 
MUL i 



for loading c{i) into the accumulator, 

for storing the value of the accumulator in location z, 

for adding c{i) to the value of the accumulator, 

for subtracting c{i) from the value of the accumulator, 

for multiplying c{i) with the value of the accumulator. 



If q is an assembly program and ni,n 2 ,ri 3 G N, then we denote by 
the value of the accumulator after having executed q on 
M with ni,n 2 ,ri 3 as initial values of memory locations a, 6, and c, respec- 
tively. 
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The code semantics of an arithmetic expression e is now specified in a 
syntax-directed way. For this purpose we will use three meaning names, viz., 
code, temp, and name, where code is the principal one, i.e., for every arith- 
metic expression e, we define ass-prog{e) to be the value of code{E) at the 
root of the derivation tree of e. The meaning names are associated with non- 
terminals as follows 

a{E) = a{T) = a(F) = {code, temp}, 
a{I) = {temp, name}. 

The carrier sets of code, temp and name are the sets 

Vcode = U is a sequence of assembly instructions}, 

^temp = N , 

Kiamc — 

The semantic equations for code, temp and name are designed as follows. Let 
t be a derivation tree and d be a node of t labeled by E, F, or T. Moreover, 
let s be the subtree of t starting at node d and let v be the frontier of s (Fig. 
1.7 where X e{E,F,T}). 




Fig. 1.7. A derivation tree t and the subtree s at node d 



• The value of the meaning name code at node d (i.e., code(X)) is the as- 
sembly program q such that Ad[gJ = Slv} and 

• the value of temp at d (i.e., temp{X)) is the height of s, i.e., the maximum 
of the lengths of all paths of s. 

If d' is a node of t labeled by I, then 

• the value of name at d' is the identifier which labels the immediate descen- 
dant of d' and 
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• the value of temp at d' is 1, because the height of the subtree rooted by d' 
is 1. 

The meaning name temp is needed to compute the value of code at node d, 
in particular, if the production *‘£'4- T, — T, or T—^T*F is 

applied at d. Since the machine has just one accumulator, the value of the first 
argument of the binary operation (+, ~, or *) has to be stored temporarily 
in a safe location in the memory before computing the value of the second 
argument. A location is safe if it is not overwritten by the computation of the 
second argument. We specify the value of temp at a node d to be the height 
of the subtree starting at node d. One can easily check that this does provide 
a safe location. 

First we consider the production p\: Eq Ei-\-T and associate semantic 
equations with it. We use the indices 0 and 1 in order to distinguish the 
two occurrences of E in the semantic equations. With pi the following two 
semantic equations are associated: 

Pi : Eo El T code(Eo) = code(Ei) STORE temp(T) 

code{T) ADD temp{T) 

temp{Eo) = max{temp{Ei)^temp{T)) 1 

Thus, for every expression eo of the form ei + ^ (where ei and t are derived 
from El and T, respectively), the assembly code code(eo) consists of the 
following pieces: 

1. the value of code(£i), which is the code of the subexpression ei (the 
execution of code(Ei) computes the expression semantics of ei in the 
accumulator), 

2. the STORE temp{T) instruction (its execution stores the value of the 
accumulator to the safe memory location with address temp(T)), 

3. the value of code{T), which is the code of the subexpression t (its execu- 
tion computes the expression semantics of t in the accumulator) , and 

4. the ADD temp{T) instruction (its execution retrieves the value from the 
memory location with address temp{T) and adds it to the contents of 
the accumulator). 

We note the semantic equation for code{Eo) fits into the general form of se- 
mantic equations by letting C be the expression f{code{Ei),temp(T), code{T)) 
where / is the function 

/ • Vcode ^ Vtemp ^ Vcode ^ Vcode 

such that, for any codes codeia code 2 E Vcode and address i G Vtempj we define 
/(codei, 2 ,code 2 ) = codei STORE i code 2 ADD i 
The semantic equations of all the productions are shown in Fig. 1.8. 
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Pi : Eq El T code{Eo) 


= 


code(Ei) STORE temp(T) code(T) 
ADD temp{T) 


temp{Eo) 




max{temp{Ei),temp{T)) + 1 


P 2 * Eq — > El — T code{Eo) 




code{T) STORE temp{Ei) code{Ei) 
SUB temp{Ei) 


temp(Eo) 


= 


max{temp{Ei),temp{T)) + 1 


Ps- E T code{E) 




code{T) 


temp{E) 


— 


temp{T) + 1 


P 4 i Tq ^ Ti * F code(To) 


= 


code{Ti) STORE temp{F) code{F) 
MUL temp{F) 


temp{To) 


= 


max{temp{Ti),temp{F)) + 1 


P5 :T F, pq : F (E) 


The semantic equations are 


the 


same as the semantic equations 


for ps except that E and T 


are 


replaced appropriately. 


P 7 : F ^ I code{F) 


= 


LOAD name{I) 


iemp{F) 


= 


temp{I) -h 1 


Ps : I a name{I) 


= 


a 


temp{I) 


= 


1 


P 9 : 1 b, pio : I ^ c 


The semantic equations are 


obtained from the equations of ps by 


replacing a by 6 and c, respectively. 



Fig. 1.8. The semantic equations for code, temp, and name in the framework of 
attribute grammars with synthesized attributes only 



As an example for the transcription of semantic equations to recursive 
function procedures, we show the recursive function procedure code-eval (Fig. 
1.9). There are seven productions p such that there is a (code, A)-equation 
of p for some nonterminal X: pi through p?. Hence, the case statement of 
code-eval consists of seven clauses. Note that, e.g., temp-eval{d2) is the call 
of function procedure temp-eval to the second descendant of d. 

We now give an example for computing the code semantics of a particular 
arithmetic expression generated by Go- Let us consider e = a* (b a) and 
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func code-eval {d: node): Vcode] 

begin 

case production applied at d of 

Pi: code-eval := code-eval{dl) STORE temp-eval{d2) 
code-eval{d2) ADD temp-eval{d2); 
P2- code-eval := code-eval{d2) STORE temp-eval{dl) 
code-eval{dl) SUB temp-eval{dl); 
ps: code-eval := code-eval{dl)] 

P 4 i code-eval := code-eval{dl) STORE temp-eval{d2) 
code-eval{d2) MUL temp-eval{d2)] 
Ps: code-eval := code-eval{dl); 

Pq: code-eval := code-eval{dl); 
pr: code-eval := LOAD name-eval{dl) 
end 

end code-eval 



Fig. 1.9. The function procedure code-eval 



assume that the derivation tree te of e is given (Fig. 1.10). We can then call 
the recursive function procedure code-eval at the root of fg- In Fig. 1.10, we 
have attached to some nodes the values of the meaning names as they are 
computed by the corresponding function calls. 

Thus, for the string e = a * (6 + a), we have obtained that code(E) at the 
root of te is the following assembly program (clearly, this is not optimal); it 
is the code semantics ass-prog{e) of e: 

LOAD a 
STORE 5 
LOAD b 
STORE 2 
LOAD a 
ADD 2 
MUL 5 

The extended abstract method 

The foregoing specification of the code semantics of arithmetic expressions is a 
typical syntax-directed one. Often however, meaning names are not sufficient 
to specify the semantics of more complex context-free languages, e.g., high- 
level programming languages. This deficiency becomes obvious if we consider 
the while statement and its translation into assembly code. Let the syntax of 
the while statement be given by the context-free production 

W — ► while £■ > 0 do L od 






1.1 Syntax- Directed Semantics 



11 



temp 

code 



temp = 6 
code = LOAD a 
STORE 5 
LOAD b 
STORE 2 

/ LOAD G 
ADD 2 
MUL 5 




a 



T temp = 1 ' ► F 

I code = LOAD a I 




/ 



Fig. 1.10. The derivation tree te of the string a*{h-\-a) with the computed values 
of code and temp at some nodes 
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where L generates any finite list of statements. Assume that the semantics of a 
string (i.e., program) of the language is an assembly program for the machine 
Ad as in the previous example. It should be clear that for such assembly 
programs a sequential control structure is not sufficient any more. Hence, we 
assume that M additionally accepts the following jump instructions: 

JUMP i for an unconditional jump to the i-th statement of the 
assembly program and 

JNP z for a jump to the z-th statement of the assembly program 
provided that the value in the accumulator is not positive, 
otherwise the next assembly instruction is executed. 

Then, as usual, for every while statement w of the form while e > 0 do 
/ od (where e and I are an arithmetic expression and a list of statements, 
respectively), the assembly program ass-prog{w) of w is composed of the 
following pieces: 

1. the code for computing the expression semantics of e in the accumulator, 

2. the instruction JNP m, where m is the address of the next instruction be- 
hind ass-prog{w) (if the value of the accumulator is not positive, then its 
execution transfers the control of M to the next instruction behind ass- 
prog(w)j otherwise the assembly instruction behind JNP is executed), 

3. the code which results from the translation of /, and 

4. the instruction JUMP n, where n is the address of the first instruction 
of ass-prog(w) (its execution transfers the control of Ad to the code 
for evaluating e again, because its value may have changed during the 
execution of /). 

In particular in the fourth piece of ass-prog{w)^ we should know the address 
n of the first instruction of ass-prog(w). One can easily see that this address 
cannot be computed by using meaning names of E and L, because meaning 
names at a node can keep information only about the descendants of that 
node. So we should extend the original abstract method in such a way that a 
node d of the derivation tree t can also get information from its surrounding 
context, i.e., from that part of i which is outside the subtree rooted by d. 
Hence we refine our original abstract method as follows: 

Syntax- directed semantics with context handling is based on the idea 
that, for a word u; of a context-free language, the semantics of a 
substring v of ti; is defined in terms of the semantics of the syntactic 
substructures of v and the context of v with respect to w. 

Now there are two possible ways to formalize this extended abstract 
method: 

• the implicit handling of context information and 

• the explicit handling of context information. 
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The first way leads to formal models which are close to denotational seman- 
tics, and the second way leads to formal models which are close to attribute 
grammars. In the rest of this section we will develop instances of these two 
types of formal models. Since this development is based on the notions and 
notations of the original abstract method, we note that the first type of for- 
mal model will appear slightly different to the style in which denotational 
semantics usually appears in the literature. We will discuss this difference 
later. 

A second formal model: denotational semantics 

In this second formal model the pieces of information about the surrounding 
of some node d of a derivation tree t, i.e., the context information of d, do not 
have explicit names. Instead, meaning names are allowed to have parameters 
and the pieces of context information are passed via such parameters. In 
this sense context information is handled in an implicit way. Certainly, the 
context information coming via d from its context can also be passed down 
to the descendants of d as parameters of other meaning names. 

As in our first formal model, the set of meaning names is denoted by M. 
However, for every meaning name m G M, the carrier set Kn of m is no 
longer a set of elementary objects, but a set of functions over other carrier 
sets (we do not consider recursive domain equations here). Thus, m is a 
function of the type Vi x . . . x V* F. As usual the mapping a : N ^ V{M) 
associates meaning names to nonterminals. To every production p of G a set 
of semantic equations is associated such that for every meaning name of the 
nonterminal on the left-hand side of p there is exactly one semantic equation 
which specifies the semantics. More precisely, let 

p : WqAiWi . . .Wn^lAnWn 

be a production of G and m G M be a meaning name. Then the m-meaning of 
every string which is derived from A by starting with production p is defined 
by a semantic equation of the form 



m{A){yi,...,yk) = C, 

where yu . • are the parameters of m, and C is an expression built up from 
the parameters basic functions over carrier sets, and compound 

objects of the form mj(Aij) where rrij G <^{Aij) and I < ij < n. Clearly, 
since meaning names have parameters, they may also occur nested in The 
left-hand side of the equation should be parsed as follows: m{A) is a function 
with k arguments, and it is applied to pi , . . . , t/jb. We also refer to this equation 
here as the (m, A)-equation. 

Intuitively, the (m, A)-equation of p defines the value of m at every node d 
at which the production p is applied. If the values of the parameters Pi , . . . , t/ib 
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are available from the surrounding of d, then the value of m is computed 
by evaluating C- This may naturally invoke the calculation of other mean- 
ing names on descendant nodes of d. To enable this computation, for every 
meaning name m, we again construct a recursive function procedure m-eval. 
It has one formal parameter of type node; additionally it has formal parame- 
ters 2 / 1 -par, . . . , yk-po>r of type Vi, . . . , Vi, respectively, which represent the 
parameters of the meaning name m. Assume that Vm is the set of functions 
of the type V\ x . . . x Vk V . Then the function procedure m-eval is shown 

in Fig. 1.11 where C is obtained from C by replacing 

• every subexpression my )(...) by my-eva/(diy , . . .), where dij is again 
the iy-th descendant of d, and 

• every yj by yj-par. 



func m-eval{d: node, yi-par: Vi, . . .^yk-par: Vi): V ; 
begin 

case production applied at d of 
p : m-eval := 

end 

end m-eval 



Fig. 1.11. The function procedure m-eval with formal parameters 



We have called this formal model denotational semantics. However, we 
should note that denotational semantics allows for much more flexible and 
powerful constructions on the right-hand sides of semantic equations (as, 
e.g., A-abstractions). In denotational semantics the meaning names are called 
semantic functions. As already mentioned, the usual notation slightly differs 
from the notation developed here. In particular, for a production 

p : WoAiWi . . .Wn^lAnWn 

and a semantic function m of A, the corresponding semantic equation looks 
like 

m[u)oWllUl . . . Wn-lUnWniiyi, ■■■,yk) = C 

where it is assumed that, for every 1 < z < n, u,* is a variable which is 
universally quantified over the set of terminal strings derived from A { . The 
right-hand side C is obtained from C by si similar notational transformation. 
Furthermore, rather than small letters (like m) capital calligraphic letters 
(like C or £) are used to denote semantic functions. In the following example, 
however, we will retain our notational system. 
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An example of a denotational semantics 

In this example we specify the code semantics of a small imperative program- 
ming language (including while statements) as assembly code by using the 
formal model based on denotational semantics. For this purpose we enlarge 
the context-free grammar Go, which generates arithmetic expressions, to the 
context-free grammar Gi. The nonterminals of Gi are the nonterminals E, 
T, F, and I of Go, and additionally 

P (start symbol) for generating programs, 

L for generating lists of statements, 

S for generating statements, 

W for generating while statements, 

A for generating assignment statements. 

The productions of Gi are the productions p\ through pio of Go (see p. 5) 
and additionally 



pii: P L 






P12* L^L\S 


Pl 3 - 


L-^S 


P14: S^W 


Pl 5 - 


S^A 


Pi 6 ^ W — > while F > 0 do L od 






P17: A — y I := E, 







Again it is quite obvious how to associate a natural semantics V\w\ with 
a program w G L{G\): it is the partial function of type such that, 

for every ni, ri2, G N, F[u;](ni, ri2, n^) - (ni, n'2, n^), if 

• the initial values of the program variables a, 6, and c are ni, ri2, and 713, 
respectively, 

• the program w is “executed” , 

• it terminates, and 

• the final values of a, 6, and c are n^, n'2, and rig, respectively. 

Clearly, the notion of “execution” is vague, but it can be substituted by 
“executed on a PC,” or “executed on a sheet of paper,” or “executed by 
hand waving,” or “executed mentally.” Henceforth this semantics is called 
the program semantics of w and it is used as a reference semantics for the 
correctness of the code semantics. 

For every assembly program q (possibly with jump instructions), we de- 
fine the partial function M{qJ : ^ in the following way: for every 

ni,n2,ri3 G N, jM[g](ni, ri2, n3) = (n'j, n'2, n'g) if 

• the initial values of the memory locations with addresses a, 6, and c are 
ni, n2, and n3, respectively, 

• the assembly program q is executed on our extended machine, 

• it terminates, and 

• the final values of the memory locations a, fc, and c are n'^, n^, and ng, 
respectively. 
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Now, for a program w E L{G\)y we want its code semantics to be an 
assembly program ass-prog{w) such that VlwJ = Mlass-prog(w)J, i.e., ass- 
prog{w) is correct with respect to the program semantics of w. 

We specify the code semantics by using denotational semantics as a formal 
model, i.e., the first formal model for the extended abstract method. The 
meaning names we need are code, codei, lengthy temp, and name, where 
code I has one parameter, which keeps the context information, and the other 
meaning names do not have a parameter. The meaning names are associated 
to nonterminals by the mapping a as follows. 

a(P) = {code} 

a{L) = a(5) = a{W) = {codei, length} 
o^{A) = {code, length} 

a{E) = a(T) = a(F) = [code, length, temp} 

a(I) = {temp, name} 

The carrier sets of the meaning names are defined as follows. 

Vcode = I 9 is a sequence of assembly instructions} 

Vcodei = {/ 1 / is a mapping from N to Vcode} 

Vtemp “ Vlength ~ N 

Vname ~ 

We now explain the intuition behind the meaning names. For this, let w 
be a program written in L{G\), let t^ be the derivation tree of w, and d be a 
node of t^ labeled by some nonterminal X E N, Let v be the frontier of the 
subtree rooted by d, thus v is a substring of w derived from X. 

• If X E {P} (i.e., d is the root of t^ and v = w), then the value of code at 
d will be an assembly program ass-prog{w) such that Mlass-prog{w)} = 

PM- 

• If X E {L,S,W}, then the value of code\ at d, (i.e., codei(X)(n)) will 
be an assembly program ass-prog{v) such that Mlass-prog{v)l = V{vJ. 
Moreover, ass-prog{v) is a substring of ass-prog{w) such that the first 
assembly instruction of ass-prog{v) is the n-th assembly instruction of 
ass-prog{w) (Fig. 1.12). 

• If X = A, then the value of code at d will be an assembly program ass- 
prog{v) such that Mlass-prog{v)J = V{v}. 

• If X E {L, S, W, A}, then the value of length at d is the length of the value 
of codei, i.e., the length of the assembly program ass-prog{v), where the 
length is measured in the number of assembly instructions. Note that the 
length of ass-prog{v) does not depend on the context information at d. 

• If X G {E, T, F}, then the value of code at d will be an assembly program 
ass-prog{v) such that Mlass-prog{v)J = £^[v]. 

• If X G {E,T, F}, then the value of length at d is the length of the value 
of code, i.e., the length of the assembly program ass-prog{v). 

• The meaning of temp is the same as in the previous example. 
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• If X = /, then the value of name at d is the terminal which labels the 
immediate successor of d. 



The semantic equations are presented in Fig. 1.13 and 1.14. We note 
that the assembly instruction STOP is used for technical reasons which will 
become clear later. Anyway, the effect of STOP is to stop assembly pro- 
grams. We also note that the expression codei(L)(0) should be read as fol- 
lows: codei{L) is a unary function which is applied to the expression 0. 

Again, the semantic equations are transcribed into recursive function pro- 
cedures. In Fig. 1.15 the recursive function procedure of the meaning name 
codei is shown. 

Finally, we compute the code semantics of a simple program in the lan- 
guage L{Gi). For this, let us consider the program 

while a — c> 0 do 

a := a — c; 
b := b * b 
od 



which we abbreviate by exp. The name exp is chosen because this program 
computes the function F defined as 



F{x,y,z 




yz) if X > z 

{x, y, z) otherwise. 
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Pi : E'o — ► £*1 +T code(£'o) = code{E\) STORE temp{T) 

code{T) ADD temp{T) 

length(Eo) = length(Ei) + length{T) + 2 
temp{Eo) = max{temp{Ei) + temp{T)) + 1 

P2 * Eq — >• E*! — T 

The same equations as for pi except that we write SUB for ADD 
in the definition of code(Eo) and exchange the roles of Ei and T. 

Ps : E T code{E) = code{T) 

length{E) = length{T) 
temp{E) = temp{T) + 1 

P 4 : To — ► Ti ♦ F code(To) = code{Ti) STORE temp(F) 

code{F) MUL temp{F) 
length{To) = length{Ti) + length{F) -f 2 
temp(To) = max{temp{Ti) + iemp{F)) + 1 

Ps : T FjPe : F (E) 

The equations for ps and pe can be obtained from that of pa by 
appropriate substitutions. 



P7 : F I 


code{F) = 


LOAD name{I) 




length(F) = 


1 




temp{F) = 


temp{I) + 1 


P8-. I 


name(7) = 


a 




temp{I) = 


1 



P9 : 1 ^ 6,pio : / — ► c 

The equations for pg and pio can be obtained from that of ps by 
replacing a by 6 and c, respectively. 



Fig. 1.13. The semantic equations for code^ codeia lengthy name, and temp in the 
approach based on denotational semantics 
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pii : P L code{P) 


= codei(L)(0) STOP 


P12 • Fo — > Li]S codei{Lo){y) 


= codei{Lx){y) 

codei{S){y + length{Li)) 


length(Lo) 


= length(Li) + length{S) 


P 13 '■ L ^ S codei{L){y) 


= codei{S){y) 


length{L) 


= length{S) 


PiA-S -*W codei{S){y) 


= codei{W){y) 


length{S) 


= length(W) 


P15 : S ^ A codei{S){y) 


= code{A) 


length(S) 


= length{A) 


P 16 •* W while £* > 0 do L od 


codei{W){y) = 


code{E) 

JNP y -f- length{E) -f- length{L) -f 2 
codei{L){y + length{E) + 1) 

JUMP y 


length{W) = 


length(E) Alength(L) -f 2 


Pi7 : I E code{A) - 


code{E) STORE name{I) 


length{A) = 


length{E) -f 1 



Fig. 1.14. (cont.) The semantic equations for code, codei, length, name, and temp 
in the approach based on denotational semantics 



where xmodz and xdivz are the remainder and the result, respectively, of 
the integer-valued division of x by z. Thus, V\exp\ = F. 

The derivation tree of exp, denoted by texp^ can be seen in Fig, 1.16, where 
the subtrees and ^4 of are defined in Figure 1.17. In both figures 

we have given names to some of the nodes so that we can refer to them. 
Note that the names c ?4 through dg refer to the occurrence of ti in Fig. 1.16. 
We compute the code semantics of exp by applying the recursive function 
procedure code-eval to the root do of texp^ The protocol of the execution of 
code-eval looks as follows. 
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func codei-eval{d: node; yi-par: Vtemp)- Vcode] 

begin 

case production applied at d of 

P12: codei-eval := codei-eval(dl^ Vi-par) 

codei-eval{d 2 , yi-par + length-eval{d 1 )); 
P13: codei-eval := codei~eval{dl, yi-par); 

P14: codei-eval := codei-eval{dl, yi-par); 

Piq: codei-eval := code-eval{d 1 ) 

JNP {yi-par H- length-eval{d 1) + 
length-eval{d 2 ) + 2) 

code\-eval{d 2 ^ yi-par + length-eval(d\) + 1) 
JUMP t/i-par; 

end 

end codei-eval 



Fig. 1.15. The function procedure codei-eval 




Fig. 1.16. The derivation tree texp of the program exp 








1.1 Syntax- Directed Semantics 



21 




code-eval{do) 



code\-eval{di){G) 

STOP 

code\-eval{d 2 ){G) 

STOP 

codei-et;a/(d3)(0) 

STOP 

code-eval{d^) 

JNP length-eval{d^) -h length-eval{dio) -f 2 
code\-eval{d\Q){length'eval{d^) -f 1) 

JUMP 0 
STOP 



Thus we have to execute the function procedure call code-eval{d^): 

code-eval{d^) = code-eval(d^) 

STORE temp-eval{d^) 
code-eval{d^) 

SUB temp-eval{d^)j 
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where code-eval{ds) = code-eval{do) — code-eval{dj) = code-eval{ds) = 
LOAD a. Similarly to code-eval{de)^ we can compute that code-evaI{dg) = 
LOAD c. Next we compute temp-eval{d 5 ) = 1 temp-eval{dQ) = 2 + temp- 
eval{d 7 ) = 3 temp-eval{ds) = 4. Hence we get 

code-eval{d^) = LOAD c 
STORE 4 
LOAD a 
SUB 4 

We now compute length-eval{d^) as follows: length-eval(d^) = length- 
eval{d^) + length-eval{dg) + 2, where length-eval{d^) = length-eval{de) = 
length-eval{d 7 ) = length-eval{ds) = 1 and by similar computations we ob- 
tain length-eval{dg) = 1. Hence length-eval{d 4 ) = 4. Similarly to length- 
eval{d 4 ) = 4, we can compute that length-eval{dio) = 10. Since length- 
eval(d 4 ) = 4, we have codei-eval{dio){length-eval{d 4 )-}-l) = codei-eval{dio)(5). 
Thus, we can compute 

codei-eval{dio){5) = LOAD c 

STORE 4 
LOAD a 
SUB 4 
STORE a 
LOAD 6 
STORE 2 
LOAD 6 
MUL 2 
STORE 6 

Finally, by substituting the computed values in code-eval{do) , we obtain 
code-eval{do) = 



0 


LOAD c 


9 


STORE a 


1 


STORE 4 


10 


LOAD 6 


2 


LOAD a 


11 


STORE 2 


3 


SUB 4 


12 


LOAD 6 


4 


JNP 16 


13 


MUL 2 


5 


LOAD c 


14 


STORE b 


6 


STORE 4 


15 


JUMP 0 


7 


LOAD a 


16 


STOP 


8 


SUB 4 







We note that the labels 0: through 16: are not part of the assembly pro- 
gram: they are just shown to emphasize the targets of the two jump instruc- 
tions. 
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A third formal model: attribute grammars 

In the literature our third formal model for syntax-directed semantics is 
known as attribute grammars. In this model the handling of context informa- 
tion is done explicitly. That is, we may use meaning names in their original 
sense, i.e., without parameters, and there is one principal meaning name mo- 
Additionally, we introduce so-called context names which are used in order 
to transmit information in the derivation tree from a node to its descendants. 
(The other direction of information transmission is managed by the mean- 
ing names.) In this sense, the handling of context information is done in an 
explicit way. 

The set C of context names is disjoint with M {M is again the set of 
meaning names). Also context names are associated with the nonterminals 
of the grammar. We use one mapping a, for both the association of meaning 
names and the association of context names; hence a : N V{M U C). 
Context names also take their values from carrier sets: for every context 
name c, a carrier set Vc is defined. The values of context names are defined 
by semantic equations associated to the productions of the grammar. In fact, 
meaning names and context names may depend on each other, as can be seen 
if we show the form of the semantic equations. For every production 

p:A^ WqAiWi . ..Wn^lAnWn, 

in P and meaning name m G c^{A) fl M , there is a semantic equation of the 
form 

m{A) = C, 

where C is an expression built up from basic functions over carrier sets, com- 
pound objects of the form either rrij{Ai^) or Zj{A) where mj G ck(Aij) fl M, 
I < ij < n, and Zj G ot{A) fl C. We call this the (m, A)-equation of p. Also, 
for every 1 < i < n and for every context name z G a(A,) fl C, there is a 
semantic equation of the form 



z{Ai) = C, 

where C is an expression built up in the same way as in the (m, A)-equation. 
We call this the (^r, A, f)-equation of p. Intuitively, it specifies the value of the 
context name z at the i-th descendant of every node at which p is applied. 

For the computation of the semantics of a particular w G L{G), we again 
transcribe the system Ssem of semantic equations into a system Sfp of re- 
cursive function procedures. More precisely, for every meaning name there is 
one function procedure and for every context name there is also one func- 
tion procedure. Both types of function procedures have one formal parameter 
which may take a node of the derivation tree of u; as a value. There are no 
additional formal parameters. Let m and z be a meaning name and a context 
name, respectively. For a node d and its direct predecessor d' we define the 
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brother number of d to be the position which d has in the sequence of all the 
direct descendants of d'. Then the form of the function procedures is shown 
in Fig. 1.18 where C' is obtained from the right-hand side C of the 
equation by replacing compound objects of the form mj{Aif) and Zj{A) by 
the function calls rrij-eval{dij) and Zj-eval(d), respectively. Moreover, is 
obtained from the right-hand side of the {z, A, i)-equation in the same way 
as C has been obtained from C and additionally d is substituted by d'. 



func m-eval{d: node): Vm ; 
begin 

case production applied at d of 
p: m~eval := 
end 

end m-eval 

func z-eval{d: node): Vz ; 

begin 

let d' be the immediate predecessor of d 

case (production applied at d', brother number of d) of 

(p,i): z-eval := C<; 

end 

end z-eval 



Fig. 1.18. The function procedures m-eval and z-eval 



The semantics of a string w G L{G) is then the value of the meaning 
name mo at the root of w^s derivation tree This value is obtained by 
calling mo-eval at the root of invoking a tree walk over tuj which may 
be very involved. It is even possible that the tree walk will loop, depending 
on the semantic equations. Fortunately, it is possible to decide whether, for 
a given system of semantic equations, there is a derivation tree such that 
the induced tree walk is cyclic. 

We note that in the theory of attribute grammars, meaning names and 
context names are called synthesized attributes and inherited attributes, re- 
spectively. However, we continue to use our terminology. 
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An example of an attribute grammar 

As an example, we now specify the code semantics of programs in L{G\) in 
a syntax-directed way by means of an attribute grammar. It has the four 
meaning names code, temp, lengthy and name and one context name first. 
The carrier sets of code, temp, lengthy and name are the same domains as 
in the previous specification, and the carrier set of first is N. Again, code is 
the principal meaning name. 

The meaning names and context names are associated to the nonterminals 
by the mapping a as follows. 

a(P) = {code} 

a(X) = {code, length, first}, for X E {L, S, W} 

a{A) = {code, length} 

a{I) = {temp, name} 

a(X) = {code, length, temp}, for X E {E, F, T} 

The intuitive idea of the meaning names should be clear by considering 
the previous two example specifications. The context name first corresponds 
to the parameter y in the specification based on denotational semantics. In 
fact, let w E L{Gi) be a program and t«, be the derivation tree of w. Let d be 
a node of t«; labeled by X and let v be the frontier of the subtree of tyj rooted 
by d (Fig. 1.12). Then, clearly, ass~prog{v) is a substring of ass-prog{w). It 
follows that the value of first at d will be the position of the first assembly 
instruction of ass-prog{v) with respect to the beginning of ass-prog{w). 

Next we associate a system Ssem of semantic equations to the productions 
of Gi, see Fig. 1.19 and 1.20. 

In order to illustrate the construction of the system Sjp of recursive func- 
tion procedures, let us consider the context name first. The corresponding 
function procedure is shown in Fig. 1.21. 

We finish this section by computing the code semantics ass-prog{exp) 
for the program exp which was shown in the previous example. We recall 
the schematic form of the derivation tree of exp, denoted by texp^ in Fig. 
1.22. The values of the meaning names and the context name at some nodes 
of t are indicated, although we could not show all values because of space 
limitations. The subtrees ti and <2 of iexp for the arithmetic expressions a — c 
and 6*6, respectively, are detailed in Fig. 1.23, where the values of code and 
length are computed at the roots of t\ and t 2 in the same way as in the 
case of Go. The codes code{a — c) and code{b * b) are then abbreviated by c\ 
and C 2 , respectively, and these abbreviations are used in Figs. 1.22 and 1.23, 
too. Thus the code of exp is the same assembly program as in the previous 
example where we specified the code semantics of L{Gi) using a denotational 
semantics approach. 
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Pi : Eq ^ E\ T ^ P2 '• Eq ^ El — T 

The same equations which were associated to pi and p 2 in Go , in 
Figs. 1.13 and 1.14. Moreover, for both productions the equation 
length{Eo) = length{Ei) + length{T) + 2 is associated. 

ps : E 

The same equations as for ps in Go and additionally the equation 
length{E) = length{T). 

P4 :T * F, p^ :T F, pe : F (E), p7 : F I 

The same semantic equations as associated with the productions 
of Go and additionally the equations for length of the nonterminal 
in the left-hand side. Note that length{F) = 1 for py. 

Ps : I a 

The same semantic equations as associated with ps in Go- 
Pd : I pio : I c 

The same semantic equations as associated with ps except that a 
is substituted by b and c, respectively. 



Pii : P —y L code{P) 


= 


code{L) STOP 


first{L) 


= ' 


0 


Pi 2 : Lq ^ Li; S code{Lo) 


— 


code{Li) code{S) 


length{Lo) 


=z 


length(Li) + length(S) 


first(Li) 


= 


first(Lo) 


first{S) 


= 


first(Lo) + length{Li) 


Pi 3 : L S code{L) 




code{S) 


length{L) 




length(S) 


first{S) 


= 


first(L) 



Fig. 1.19. The semantic equations for code, first, and length in the framework of 
attribute grammars 
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PMiS^W 

The same as for pi 3 except that L and S are substituted by S 
and W. 

Pi5 • S — A 

Similarly as for pi 4 with the exception that there is no semantic 
equation for first. 

Pi6 • W while £* > 0 do L od 

code(W) = code{E) JNP first{W) + length{E)-\- 
length(L) + 2 
code{L) JUMP first{W) 
length{W) = length{E) + length{L) + 2 
first{L) = first{W) -h length(E) -f 1 

Pi 7 : A I := E code{A) = code{E) STORE name{I) 
length{A) = lengih(E) + 1 



Fig. 1.20. (cont.) The semantic equations for code, firsts and length in the frame- 
work of attribute grammars 



func first-evQ 


,l{d: node): Vjirst] 




begin let d' be the immediate predecessor of d 


case (production applied 


at d', brother number of d) of 


(pii 


, 1): first-eval := 


0; 


(Pl2 


, 1): first-eval := 


first-eval{d'); 


(Pl2 


,2): first-eval := 


first-eval{d^) + 






length-eval{d' 1); 


(Pl3 


, 1): first-eval := 


first-eval{d^)\ 


(Pl4 


, 1): first-eval := 


first-eval{d’)\ 


(Pl6 


,2): first-eval := 


first-eval{d') -f 






length-eval{d' 1) + 1 


end 






end first-eval 





Fig. 1.21. The function procedure first-eval 
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Fig. 1.22. The derivation tree texp with the computed values of the meaning names 
and the context name at some nodes 
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Fig. 1.23. The subtrees ti and <2 of texp with the computed values of code and 
length at their roots 
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The involved machine-oriented computation paradigm 

Up to now we have specified the intended semantics of a context-free language 
by means of three formal models. Each of these models consists of a system 
Ssem of semantic equations, or better: a system Sfp of recursive function 
procedures and a computation paradigm which can organize and maintain 
the sequence of calls of recursive function procedures. 

Intuitively, it is clear how this calling process looks, and probably every 
first-year student of computer science will learn the effect of a call of a re- 
cursive function procedure. This is exactly the reason we have chosen this 
computation paradigm to give a semantics to the specification. 

On the other hand, an abstract machinery which maintains recursive func- 
tion procedure calls automatically is not so trivial to define, and consequently 
it is not likely that this is taught in detail to first-year students. The most 
common relevant machinery consists of a runtime stack machine M runtime 
and a compiler comp. The machine Mruntime mainly consists of a stack which 
keeps track of the incarnations of the function procedures and the values of 
the parameters using static and dynamic links. The compiler comp trans- 
lates the system Sjp into an assembly program ass{Sfp) for Mruntime- Thus 
the scenario of Fig. 1.4 has to be refined, and the more detailed situation is 
shown in Fig. 1.24. Later in our discussion we will return to the role of the 
computation paradigm. 



specification 
by Ssem 



transcription 



system 

Sfp 



comp 



(5/p) 



parser 



string 



sds evaluator 



derivation 

tree 




value 



Fig. 1.24. The sds evaluator based on Mruntime and on comp 
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1.2 Tree Transducers 

The aim of this book is to develop a theory of formal models of syntax- 
directed semantics. This theory contains abstract properties of such models. 
In particular, the properties do not depend on certain assumptions and de- 
tails which are inherent in the three models of the previous subsections, and 
therefore we will abstract from them. More precisely, 

• we abstract from derivation trees and use trees over some ranked alphabet 
instead, 

• we abstract from the semantic domain and use the initial term algebra 
instead, and 

• we abstract from the machine-oriented computation paradigm and use a 
rule-based derivation system instead. 

This abstraction process produces formal models of syntax-directed semantics 
which we call tree transducers. In the following subsection we will explain the 
three abstraction steps in more detail. 

Abstracting from derivation trees 

Recall that, with every system 5,em of semantic equations, we have associ- 
ated a system Sjp of recursive function procedures. Every function procedure 
has at least one formal parameter d which represents a node of the given 
derivation tree t^} (e.g., the function procedure code\-eval in Fig. 1.15). The 
involved case statement represents a case analysis which depends on the pro- 
duction p applied at d. Thus, to be precise, we are not dealing with derivation 
trees but with abstract syntax trees. In such trees the nodes are labeled by 
productions of the underlying context-free grammar, and a node with label 
p has as many descendants as there are occurrences of nonterminals in the 
right-hand side of p. For instance. Fig. 1.25 shows the abstract syntax tree 
which corresponds to the derivation tree t^xp of exp in Fig. 1.16; ^3 and t'^ 
are the abstract syntax trees of ^3 and t^ of Fig. 1.17. 

Thus we should slightly adapt the scenario of Fig. 1.24 to catch the fact 
that the function procedures are working on abstract syntax trees (Fig. 1.26). 

The function procedures also change slightly. Rather than testing the 
production applied at a node d, it tests the label of d. For instance, the 
adapted function procedure code\~eval is shown in Fig. 1.27. 

In Fig. 1.26 let us now consider the border between the parser and the 
syntax-directed semantics evaluator. The parser produces an abstract syntax 
tree t as output, and t is taken as input for the syntax-directed semantics 
evaluator. Clearly, the syntax-directed semantics evaluator does not have to 
check again whether the tree it receives from the parser is an abstract syntax 
tree or not, and hence this membership test is not an intrinsic part of the 
syntax-directed semantics evaluator. And since we want to study the abstract 
properties of formal models which can be used as syntax-directed semantics 
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Fig. 1.25. The abstract syntax tree of exp 



evaluators, we consider from now on only those formal models which accept 
arbitrary trees, i.e., trees over an arbitrary ranked alphabet as input. The 
syntax-directed semantics evaluator then becomes a tree~to~value evaluator 
which takes a tree as input and computes a value. The resulting scenario is 
shown in Fig. 1.28. In this sense we have abstracted from derivation trees. 

In fact, the form of our function procedures does not even have to change. 
We just consider a production as a symbol which has a particular arity or 
rank. Note that there are no longer any sorts requiring that productions 
should fit in the usual way when they are glued together. 

Abstracting from the semantic domain 

Our investigations about formal models should also not depend on particular 
properties of the chosen semantic domain. For instance, we do not want 
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Fig. 1.26. The sds evaluator working on abstract syntax trees 



func code\-eval{d: node; y\-par: Vtemp)- Vcode\ 

begin 



case label of d of 




Pl2- 


codei-eval 


:= code\-eval(d\^yi-par) 






code\-eval{d 2, y\-par -f length-eval{d 1)); 


Pl3‘ 


codei-eval 


= codei-eval{dl^y\-par); 


P14-. 


codei-eval 


= code 1 -e va/(d 1, j/i -par); 


Pie: 


code\-eval 


= code-eval{d 1) 



JNP {y\-par -f lengih-eval{d 1)+ 
lengih~eval{d2) + 2) 

code\-eval{d 2, y\-par 4- length-eval{d 1) + 1) 
JUMP yi-par] 

end 

end codei-eval 



Fig. 1.27. The function procedure codei-eval working on abstract syntax trees 



to deal with algebraic laws like associativity or commutativity, or with the 
question concerning whether the whole arithmetic on natural numbers can 
be performed in this model. Furthermore, we would like to compare different 
formal models, and the comparison should not depend on their semantic 
domain. Therefore we abstract from the semantic domain in the same way as 
in the theory of program schemes, and we choose the initial term algebra as 
the semantic domain. The tree-to- value evaluator then changes into a tree~io~ 
tree evaluator which, computes a tree. This is in fact not a restriction because, 
for every semantic domain £>, there is a unique homomorphism which maps 
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Fig. 1.28. The tree-to- value evaluator 



trees over operation symbols of D to values of D. The homomorphism, so to 
speak, interprets the tree. The resulting scenario is shown in Fig. 1.29. 



specification 
by Ssem ' 



transcription 



system 

5/p 



comp 
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Fig. 1.29. The tree-to-tree evaluator 



Now the function procedures have to change again slightly, because the 
right-hand sides of the case clauses are trees. We show an example in Fig. 
1.30 which is derived from the function procedure in Fig. 1.27. We use the 
dot as a binary symbol which concatenates an assembly instruction with 
a list of assembly instructions. Note also -h occurs in a prefix form. 
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func codei-eval{d: node; yi-par: Vtemp)- Vcode] 

begin 

case label of d of 

P12: codei-eval := •{codei-eval(dl,yi-par)^ 

codei-eval{d 2, +(j/i-par, lenght-eval{d 1)))); 

P13: code\-eval := codei-eval{dl, yi-par); 

P14: codei-eval := codei-eval{dl, yi-par); 

Pie: codei-eval := {code-eval{dl), 

(JNP +(2/1 -par, -\-{length-eval{d 1 ), + 
{length-evai{d 2 ), 2 ))), 

•(codei-eva/(d 2, -f-(yi -par, -\-{length-eval{d 1), 1))), 
JUMP 2/i-par))); 

end 

end codei-eval 



Fig. 1.30. The function procedure codei-eval working on arbitrary trees and pro- 
ducing trees 



Abstracting from the machine-oriented computation paradigm 

As discussed in the last subsection of Section 1.1, we have assumed a machine- 
oriented computation paradigm. We realize that, in order to let any abstract 
machinery calculate the semantic value of some string w, we first have to 
define an abstract runtime stack machine Mruntime which keeps track of the 
incarnations of the function procedures, and a compiler comp which trans- 
lates a system Sfp of recursive function procedures into an assembly program 
ass{Sfp) for the machine Mruntime- But again, we are not interested in imple- 
mentation issues and particular techniques for handling recursive procedures 
on a machine. Hence, we try to replace the machine-oriented computation 
paradigm by a more abstract computation paradigm. 

Here we have chosen the framework of a derivation system (or: abstract 
reduction system). In general, a derivation system consists of a set A of 
intermediate results and a binary relation =>C A x A; this relation is called 
a derivation relation. It defines how one intermediate result is transformed 
to another intermediate result which is somehow “closer” to the final result. 
The relation => does not have to be deterministic, confluent, or Noetherian. 
Thus, our scenario changes again and the tree-to-tree evaluator is replaced 
by a iree-to-tree transducer (Fig, 1 . 31 ). In this book, tree-to-tree transducers 
are referred to as just tree transducers. 

In our context the derivation relation => is based on a set R of rules and 
thus is denoted by =>r. The rules themselves are obtained by an easy tran- 
scription of the semantic equations. To illustrate this, reconsider the function 
procedure codei-eval (Fig. 1 . 30 ). The body of codei-eval is a case statement 
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Fig. 1.31. The tree-to-tree transducer 



in which the transcription of the ( code i, X)-equat ions of some production p 
is collected. Now we dissolve this case statement again and, for every clause 
(i.e., semantic equation), we construct a rule. In particular, for code\-eval we 
obtain the rules which are shown in Fig. 1.32. 



codei{pi2(xi,X2),yi) 

codei{pis{xi),yi) 

codei{pi4{xi),yi) 

codei{pie{xi,X2),yi) 



•{codet{xt,yi), 

codei{x 2 , +(yi , length{xi)))) 

codei{xi,yi) 

codei(xi,yi) 

■{code{xi), 

(JNP+(j/i , +{length{xi), +(length{x 2 ), 2))), 
JUMPyi))) 



Fig. 1.32. The rules for codei 



The result of the three abstraction steps 

If we apply the abstraction steps of the three previous subsections to the 
formal models which were considered in Section 1.1, i.e., to 

• attribute grammars with synthesized attributes only, 

• denotational semantics (of a particular type, as mentioned), and 

• attribute grammars, 
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then we obtain corresponding tree transducers: 

• top-down tree transducers, 

• macro tree transducers, and 

• attributed tree transducers, 

respectively. Figure 1.33 gives an intuitive idea about the relationship between 
these tree transducer types. That is, starting from top-down tree transduc- 
ers and adding parameters (i.e., implicit handling of context), macro tree 
transducers are obtained, whereas adding inherited attributes (i.e., explicit 
handling of context) attributed tree transducers are obtained. Adding both, 
parameters and inherited attributes to top-down tree transducers, leads to 
macro attributed tree transducers. In fact, these four types of tree transducers 
are the subject of the investigations in this book; they are 

formal models of syntax- directed semantics. 

In the following let TOP, MAC, ATT, and MAT denote the classes of 
tree-to-tree transformations (for short: tree transformations) which are in- 
duced (or computed) by top-down tree transducers, macro tree transducers, 
attributed tree transducers, and macro attributed tree transducers, respec- 
tively. 




Fig. 1.33. The formal models considered and their syntactic relationship 



1.3 Theory of Compositions of Classes 
of Tree Transformations 

In the previous two sections we have concluded that tree transducers are 
formal models of syntax-directed semantics. In other words, every type of tree 
transducer can be considered as a specification language for syntax-directed 
semantics. 
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As usual in software engineering, it is desirable that a specification lan- 
guage is compositional in the sense that, if the specification is split up into 
several phases, the specification language should support the separate pro- 
gramming and testing of the phases. Moreover, it should be easy to integrate 
the phase specifications into one specification. 

Formulating this in terms of tree transducers, we obtain the situation 
shown in the upper part of Fig. 1.34: the desired specification of syntax- 
directed semantics is composed of n subspecifications, and the z-th subspeci- 
fication is written in the specification language Ui where Ui is a type of tree 
transducer. Then, the output of the z-th subspecification is the input of the 
(z H- l)-th subspecification. In other words, the upper part of Fig. 1.34 shows 
the functional composition of n subspecifications. 

In this setting, it is advantageous to know whether the composition of two 
consecutive subspecifications which are written as tree transducers of type Ui 
and Ui^i , respectively, can be specified by one tree transducer of some type 
U (see the lower part of Fig. 1.34). We support this observation with four 
reasons: 

• Because of space restrictions, it might be desirable to avoid the compu- 
tation and maintenance of the intermediate result after the z-th subspec- 
ification. In this respect, the knowledge about the composability of two 
consecutive subspecifications has been beneficially integrated, e.g., in the 
tool for compiler specification given in [GGV86] (also cf. [Gie88]). 

• From the complexity of the subspecifications, one might wish to deduce the 
complexity of the whole specification. Here the complexity is measured by 
the membership of a specification in a particular class of tree transducers. 

• Assume that there are program transformations available (like fold and un- 
fold or the elimination of recursion) which are applicable to tree transducers 
of type U . Then one might wish to compose two consecutive subspecifica- 
tions into one specification written in J7, and to apply these transformations 
to the resulting specification and test the result for its efficiency. Such a 
program transformation has been successfully investigated for macro tree 
transducers in [MV95a, MV95b]. There, an automatic transformation strat- 
egy is defined which yields, for every given macro tree transducer M, a tree 
transducer M'. It has been proved that (1) M' is at least as efficient as M, 
(2) there is an algorithm which decides whether M' is more efficient than 
M, and (3) there are infinitely many macro tree transducers for which this 
transformation yields a more efficient tree transducer. 

• There are very efficient implementations of tree transducers of type U . We 
refer the reader to [ERS80], [EV86], [Fil83] for machine characterizations 
of top-down tree transducers, macro tree transducers, and attributed tree 
transducers, respectively. In a certain abstract sense, these machines are 
the most efficient implementations, because their data structure cannot be 
simplified without losing the possibility of implementing every transducer 
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of the particular type. Now, if one knows that the composition of subspec- 
ifications of Ui and Ui^i yields a specification in U y then one can use the 
machine corresponding to the class U . 




Fig. 1.34. Specification of syntax-directed semantics in n phases by tree transduc- 
ers of type U\y. .. yUiy Ui^i , . . . 17n, respectively 



These reasons motivate the study of the composition of tree transfor- 
mations which are elements of the classes TOP, MACy ATT and MAT. 
Typically our investigations concern the comparison of two classes 

U\o . . .oUm and Vi o . . . o 

of tree transformations where m,n > 1 , J 7 i,..., Um) Vi, . . . , are elements 
of the set {TOP, MACy ATT, MAT}, and o denotes the composition of tree 
transformations in the usual sense: if ri and T2 are tree transformations, then 
T1OT2 is a tree transformation such that for every tree ty {T\OT2){t) = T2(ti(^)); 
this composition of tree transformations is extended in an obvious way to 
classes of tree transformations. Thus, U =.U\o . . ,0 Um and V = Vi o . . . o 
are two classes of tree transformations. 

With respect to the comparison of U and Vy one of the following four 
cases is true: 

• U =Vy 
^ UCVy 

• UDVyOV 

• U and V are incomparable. 

If U CK, m=l,n>l, and the K’s are “small” classes, then we quote 
this comparison as a decomposition resuliy because it shows how transducers 
of type U can be decomposed into transducers of simpler types. If U C Vy 
m > 1 , n = 1 , and the J 7 ,’s are “large” classes, then we quote this compari- 
son as a composition result. Clearly, in the case of equality we talk about a 
characterization result. 
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We establish certain techniques based on height properties and path lan- 
guages to test whether V — U ^0. This inequality together with U C V 
yields U CV, 

We consider two particular sets Hi and i ?2 of classes of tree trans- 
formations; more precisely, Hi = {TOP, l-TOP, HOM) where l-TOP and 
HOM are the classes of tree transformations induced by linear top-down 
tree transducers and homomorphism tree transducers, respectively, and H 2 = 
{TOP, ATT, MAC}. For each of these sets we develop an algorithm which 
decides, for every two classes U = C/i o . . . o Um and V = Vi o .. .oVn of tree 
transformations such that Ui,. . t/m, Vi, . . .,Vn E Hi (for i = 1 or z = 2), 
which of the four cases mentioned above is true. In other words, for every 
i G {1,2}, we examine the semigroup generated by Hi with composition as 
the operation. We call this the composition semigroup based on Hi. 

The shape of the developed algorithms is by no means restricted to the 
two particular base sets Hi and H^. It has been successfully applied to other 
such sets, cf. Sect. 3.8. 



1.4 Other Formal Models 

The previous sections indicate the formal models which we are concerned 
with in this book. Certainly, there are also other formal models which can be 
quoted as being syntax-directed. They can be classified according to the way 
in which they handle the input tree, i.e., the tree which directs the definition 
of the semantics. There are 

• top-down models; here the input trees are processed by starting at the 
root and proceeding towards the leaves; e.g., generalized syntax-directed 
translation schemes [AU69a, AU69b, AU71], pushdown tree transduc- 
ers [Gue83], graph translation schemes [Son87], high-level tree transduc- 
ers [EV88], modular tree transducers [EV91], downwards accumulation 
[Gib91, Gib93], catamorphisms [Bir86, Mee86, Fok92], tree transducers 
with external functions [FHVV93], and top-down tree-to-graph transducers 
[EV94]; 

• bottom-up models; here the input trees are treated by starting at the leaves 
and proceeding towards the root; e.g., bottom-up tree transducers [Eng75], 
tree pushdown transducers [SG85], upwards accumulation [Gib91, Gib93], 
anamorphisms [Bir86, Mee86, Fok92], and bottom-up tree-to-graph trans- 
ducers [EV96]; and 

• tree-walking models; here the input trees are treated by starting at the 
root and by performing an arbitrary tree walk on them; e.g., pushdown 
assembler machines [LRS74], checking-tree pushdown transducers [ERS80], 
higher order attribute grammars [KSV89], and attributed nested stack tree 
transducers [FV92a]. 
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With respect to this classification, top-down tree transducers and macro 
tree transducers are top-down models, and attributed tree transducers and 
macro attributed tree transducers are tree-walking models. 



1.5 Outline of the Book 

This book contains eight chapters, a list of figures, a bibliography, and an 
index. The last section of every chapter is devoted to bibliographic notes 
about the topics treated in that chapter. 

After this introductory chapter, we collect in Chap. 2 all the basic notions 
and notations which are used in this book. Chapter 2 can be skipped on first 
reading. 

In Chaps. 3, 4, 5, and 7 we develop the concepts of top-down tree trans- 
ducers, macro tree transducers, attributed tree transducers, and macro at- 
tributed tree transducers, respectively. For every type of tree transducer the 
corresponding chapter has at least the following sections (not necessarily in 
this order): 

• a section with basic definitions in which the tree transducer model is de- 
fined, 

• a section about the induced tree transformation in which the derivation re- 
lation and the tree transformation computed by this type of tree transducer 
is defined, 

• a section about an inductive characterization of the tree transformation, 

• a section stating height properties in which the relationship between the 
heights of input trees and corresponding output trees is investigated, and 

• a section with composition and decomposition results which involves the 
class of tree transformations computed by this type of tree transducer. 

Additionally, Chapter 3 contains sections about different subclasses of 
TOP, an inclusion diagram of these subclasses, and a composition semigroup 
generated by the set of these subclasses. 

Chapter 6 contains a detailed comparison of macro tree transducers and 
attributed tree transducers including the discussion about various restric- 
tions. Moreover, it contains an inclusion diagram of the classes TOP, MAC, 
ATT, and the subclasses computed by the restrictions. At the end of Chap. 
6 we investigate the composition semigroup generated by TOP, MAC, and 
ATT, 

Finally, Chap. 8 gives two more complicated examples which are moti- 
vated by practical problems. The first example deals with the contextual 
analysis in block-structured programming languages, the second shows how 
insertion in 2-3 trees can be specified by means of a macro attributed tree 
transducer. 
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1.6 Bibliographic Notes 

The concept of syntax-directed semantics goes back to [Iro61]. The develop- 
ment of Sects. 1.1 - 1.3 has been deeply influenced by [EngSl]. The algorithms 
for computing the values of meaning names and context names are taken from 
[Eng84]. The idea for our code semantics example is due to [AU73]. 

Attribute grammars were first introduced in [Knu68] and then studied 
intensively, for surveys cf., e.g., [DJL88, DJ90, AM91, Sch95, KV97, Paa95]. 

The concept of denotational semantics is due to [Str66, SS71]. It was 
further established in [Sto77]. Also cf. [Gor79, Ten91, Wat91, Win93, GM96]. 
For a survey paper see [Mos90]. 

The technique of keeping track of and maintaining calls of recursive func- 
tion procedures is a subject treated in compiler theory, cf., e.g., [WG84, 
ASU86, Kas90, WM97]. 

For surveys and books about the theory of program schemes cf., e.g., 
[Pat72, Man73, CM73, Man74, Eng74, Gre75, Cou90, Nol95]. 

In [Hue80], several important, abstract properties of derivation systems 
(there called: abstract reduction systems) are proved. 

For the origins of top-down tree transducers, macro tree transducers, at- 
tributed tree transducers, and macro attributed tree transducers we refer the 
reader to the bibliographic notes in Chaps. 3, 4, 5, and 7, respectively. Here 
we only mention the survey works [GS84, GS97]. 

For articles about program transformations, we refer the reader to [BD77, 
PP86, Par90]. 

For the theory of compositions of tree transformations, we also refer to 
[Kiih97] in which several subclasses of MAC and ATT have been investigated 
from this point of view. In particular, subclasses based on the “syntactic 
single used requirement” of Ganzinger and Giegerich [Gan83, GG84, Gie88] 
are studied. 




2. Basic Notions and Notations 



2.1 Sets and Relations 

We denote by N the set of nonnegative integers. 

Let A and B be two sets. We denote by card(A) the cardinality of -4, by 
P(A) the power set of A, by A C B that A is a subset of B, by A C B that A 
is a proper subset of B, and by A ^ B that A is not a subset of B. Moreover, 
A K B stands for the fact that A and B are incomparable with respect to 
inclusion, i.e., that neither AC B nor B C A holds. 

Given two sets A and B, any subset 0 of the Cartesian product Ax B 
is called a relation from A to B. We write a0b to mean that (a, 6) G 6- The 
inverse of 0 is the relation 0~^ = {(6, a) | (a, 6) G 0} from B to A. 

Let 0 C ^ X B be a relation. If for every a G -4, there is exactly one b E B 
such that 006, then 0 is called a function or alternatively a mapping from A 
to B. The fact that 0 is a function from A to B is denoted by 0 : >1 B. If 
0 is a function and a0b^ then we also write 0(a) = 6. Moreover, for B C A we 
put 

0(B) = {0(6) |6gB}. 

The range of a function 0 is denoted by range{0) and it is defined by the 
equation range{0) = 0{A). 

We now introduce the notion of composition for relations. Let 0 be a rela- 
tion from A to B and let r be a relation from B to C. Then the composition 
of 0 and r is the relation 0 or from A to C defined by 

0 o r = { (a, c) I a0b and brc for some 6 G B }. 

Note that if 0 and r are functions, then (0 o r)(a) = r(0(a)). 

We extend the concept of composition to classes of relations. If Y and Z are 
classes of relations, then 

YoZ = {0or\0eY a^ndreZ}, 

Moreover, we define the n-th power of y as follows. Let Y^ =Y and for any 
n>i,iet y” = yoy«-i. 

A relation from A to A is also called a relation over A. Let 0 be a relation 
over A. We sav that 0 is 
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- reflexive if, for every a E Ay it holds that aOa; 

- symmetric if, for every Oyb E A, the condition aOb implies bOa; 

- transitive if, for every aybyCE Ay the conditions aOb and bOc imply aOc. 

If 6 is reflexive, symmetric, and transitive, then it is an equivalence relation 
over A. For an element a E Ay the equivalence class containing a is the set 
a/0 = {b E A \ a0b}. If 0 is an equivalence relation over Ay then a. set N C A 
is said to be a set of representatives for 0 if, for every a E Ay there exists 
exactly one b E N such that a0b. 

The identical relation {(a, a)\aE A} over A is denoted by Id{A), 

The n-fold composition of 0y denoted by 0^^y is deflned by 0^ = Id{A) 
and 0^ = 0 o 0^~^y if n > 0. We introduce the notion 0* = Un>o^”* 
relation 0* is called the reflexive, transitive closure of 0y meaning that it is 
the smallest reflexive and transitive relation that contains 0. 



2.2 Partial Orders 

A relation 0 over a set A is antisymmetric if, for every a, 6 G A, the conditions 
a0b and b0a imply a = b. A reflexive, antisymmetric, and transitive relation 
is called a partial order. A set A with a partial order 0 over it is called a 
partially ordered set. Let A be a partially ordered set with the partial order 
0. If an element a G A is such that, for every b E Ay the property b0a implies 
a = by then a is called a minimal element of A. Moreover, two elements a 
and 6 of A are called incomparable if neither a0b nor b0a holds for them. 

Inclusion diagrams 

Any flnite partially ordered set has the nice property that it can be visualized 
by drawing its Basse diagram. The Basse diagram is in fact a graph presented 
as follows. 

Let A be a flnite set with the partial order < on it. Let us represent every 
element a of A by a node which is labeled by a. We will call this node a-node. 
Draw the nodes representing the elements of A in such a way that whenever 
a < b holds for two different elements a and 6 of A, the 6-node is drawn above 
the a-node. Moreover, for every a, 6 G A, connect the a-node and the 6-node 
by an edge if a and 6 satisfy the conditions that a < 6 and, for every c G A, 
a < c <b implies that a = c or 6 = c. From this diagram we can reconstruct 
< by noting that a < b holds if and only if a = 6 or the 6-node is above 
the a-node and we can go from the 6-node to the a-node via a sequence of 
descending edges. 

In general, infinite partially ordered sets cannot be visualized so obviously. 
However those appearing in this book will be so special that one should be 
able to reconstruct them by having a flnite piece of their Basse diagram. 
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We will especially consider partially ordered sets of which the elements are 
classes and the partial order is the inclusion relation C. The Basse diagram of 
such a partially ordered set will be called the inclusion diagram. So whenever 
we say inclusion diagram, it should be clear that we mean the Basse diagram 
of a set H of classes, with the inclusion as a partial order over H. 

To give an example, let and D be classes and let C be the relation 

{(JB, A), {B, C), (D, C)}. Then the inclusion diagram is shown is Fig. 2.1. 




We now present a method concerning inclusion diagrams which will be 
used later in the book. Assume that there is a finite set H of classes and that 
the elements of H are known to us. We would like to construct the inclusion 
diagram of H. Bowever, sometimes we are not sure about all inclusion rela- 
tions among the elements of if. In such a case, we can do the following. We 
make a conjecture about the inclusion diagram and we try to prove that the 
conjectured diagram is the inclusion diagram of if . By a conjectured diagram 
we mean a visualized graph whose nodes are labeled by elements of if and 
there may be edges between the nodes such that an edge can connect two 
nodes only if one of them is above the other. The proof of the claim that it 
is the inclusion diagram of if is in fact the proof of the claim that the inclu- 
sion relations are those and only those which are shown by the conjectured 
diagram. It should be clear that this implies the proof of the following three 
statements for every A,B £ H. 

• If the A-node and the B-node are situated in the diagram such that the 
B-node is above the A-node and there is an edge from the B-node to the 
A-node, then the inclusion AC B and the inequality B — A ^ ^ hold. 

• If the A-node and the B-node are the same, (i.e., A and B are conjectured 
to be equal), then AC B and B C A hold. 

• If there is no descending path between the A-node and the B-node (i.e., A 
and B are guessed to be incomparable), then the inequalities A — B ^ ^ 
and B — A^^ hold. 

Bence, given a conjecture for the inclusion diagram of /f , we can construct 
a finite number of inclusions and inequalities whose validity is necessary and 
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sufficient to verify that our conjecture is the inclusion diagram of H. However, 
this finite number may be large even if the size of the conjectured diagram is 
relatively small. Therefore it would be nice to reduce this number. Unfortu- 
nately nothing can be done about the number of inclusions because we should 
verify all inclusions. However, the number of inequalities can sometimes be 
reduced using the observation presented below. 

With the following procedure, we can construct the set INCL of for- 
mal inclusions and the set INEQ of formal inequalities from the conjectured 
diagram where “formal” means that the inclusions and inequalities are con- 
sidered as formulas which are either true or false in H, 

For every A,B £ H, 

• If the i4-node and the H-node are situated in the diagram such that the 

H-node is above the A-node and there is an edge from the H-node to the 
A-node, then let A C H G INCL and B — ^ INEQ. 

• If the A-node is the same as the JB-node, then let A C B E INCL and 
BCAe INCL. 

• If there is no descending path between the A-node and the JB-node in the 
diagram, then let A - H ^ 0 G INEQ and B — A^^ E INEQ. 

As we mentioned above, the conjectured diagram is the inclusion diagram 
of H if and only if all formulas in INCL and INEQ are valid. 

Now assume that we could verify all formulas in INCL. We observe that 
the number of formal inequalities to be verified may be reduced. If we can 
verify A — H ^ 0 G INEQ and there is another formula A' — ^ 0 G INEQ 

such that A C A! and B' C B hold due to previously verified inclusion 
relations in INCL, then A' — ^ 0 is also verified. 

Hence we can define the relation < over the set INEQ such that 

(A-H^0) < (A'~H'^0) (*) 

if A C A' and B' C B hold. We can easily see that < is a partial order 
over INEQ. Moreover, by the above reasoning, for any formal inequalities 
g, E INEQ, lig < g' and g is valid, then g' is also valid. Thus it is sufficient 
to prove the formal inequalities which are minimal with respect to <. 

Hence, we can construct the inclusion diagram of a finite set H of classes 
with the partial order C applying the following method. 

Algorithm 2.1. The construction of the inclusion diagram of a finite set H 
of classes. 

1. Conjecture the inclusion diagram of H. 

2. Construct the sets of formal inclusions INCL and formal inequalities 
INEQ from the conjectured diagram applying the three steps described 
above. 

3. Verify that all formulas in INCL hold. 
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4. Construct the set MINEQ of minimal elements of INEQ with respect 
to <. (The construction of MINEQ can be made by simple manual 
calculation or, when there are too many formulas in INEQ, by invoking 
a standard algorithm which computes the minimal elements of a finite 
partially ordered set.) 

5. Verify that all formulas in MINEQ hold. □ 

Let us apply this algorithm to an example and assume that Fig. 2.1 shows 
a conjectured diagram. Then IN CL contains the following formal inclusions: 

BCA, BCC, Dec 

and the set INEQ contains the following nine formal inequalities: 

A-B^:^ C-B^H^ C-Dz^^ 

B-D^ib D-B^(H A-Di^^ 

D-Aii^ A-Ci^a C-A^m 

Then, for example, D — A-^^<C — A-^% and C — A^^<C — B-:^^. 
Next, we can compute the set MINEQ which consists only of the following 
five formal inequalities: 

A-BiL{li C-D^Hi B-D:^^ 

D-A^^ A-C^m 

Topological sorting 

A partial order < on a set A is called linear order if in addition, for every 
a,b E A, a < b OT b < a holds. A set with a linear order on it is called linearly 
ordered set. 

Clearly, if A is a finite set, then every linear order </ on A can be repre- 
sented as a sequence fi = 0102 . . . On such that A = {ui, U 2 j • • • , and, for 
every i with 1 < i < n — 1, a,* </ a,+i. 

Every partial order < on A can be embedded into a linear order </ such 
that < is a subset of </. The algorithm which constructs such a </ is called 
topological sorting. 

Algorithm 2.2. Topological Sorting 

Input: a finite set A and a partial order < on A 

Output: a linear order <\ on A such that < is a subset of <i 

Method: The algorithm needs the following variables: 

T: subset of A x A 

T\ : a sequence of elements of A 

The algorithm works as follows: 
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T:= <; 

Ti := the empty sequence; 
while T ^ 0 do 

choose an element 6 E -A such that there is no a E A 
with a^h and (a, 6) E T; 

Ti := T, . 6; 

r:=r-{(6,c)|(6,c)€r} 

end : 



Due to the choice in the body of its while-statement, the topological 
sorting is a nondeterministic algorithm. This nondeterminism reflects the 
fact that, for a partial order <, there may be more than one linear order </ 
such that < is a subset of </; the algorithm constructs one such linear order. 

In this book, a frequent instance of the linearly ordered set is the hierarchy. 
A hierarchy is a family of classes {Ca- | Ar > 0} such that Ck C Cfc+i, for 
every Ar > 0. A hierarchy is said to be proper if every inclusion is proper, i.e., 
Ck C Cib+i, for every A: > 0. It should be clear that the inclusion diagram of 
a hierarchy is a vertical chain. 



2.3 Directed Graphs 

In the book we will also work with directed graphs. A directed graph is a pair 
D = (V, E)^ where F is a set, called the set of nodes, and E QV xV is the 
set of edges. If (a, b) E £*, then we say that there is an edge from a to 6 in 
the graph. (Note that edges are unlabeled and there are no multiple edges 
between nodes.) A path in the graph is a sequence ai, . . . , Un of nodes such 
that, for every 1 < i < n, there is an edge from a,* to a,*+i, i.e., (a,-, a,+i) E E. 
Moreover, a cycle in the graph is a path with a\ = an- 



2.4 Derivation Systems 

A derivation system (or reduction system, see for example [HueSO], [Boo83], 
and [Klo92]) is a system {A, =>), where is a binary relation over A called 
a derivation relation (or reduction relation). 

For an arbitrary a E A, a derivation starting from a is a sequence 
ai, . . . , On G A for some n > 1 such that a = ai and, for every 1 < i < n — 1, 
Oi => The length of this derivation is n ~ 1. Clearly, in general there 
may be more than one derivation starting from a particular element a. A 
derivation is a derivation starting from some a E A. 
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An element a E A is irreducible (with respect to =^) if there is no 6 G A 
such that a b. If for some a E A there is exactly one b such that a =>* b 
and b is irreducible, then 6 is called the normal form of a, and it is denoted 
by n/(=>,a). We will show that a has this property shortly by saying that 
nf(=^y a) exists. 

We put 4=>= (=> U We note that is an equivalence relation over 

A. 

We say that 

(a) =» is terminating on A if there is no infinite sequence of the form 

ai 02 . . . with oi, 02 , ... G A; 

(b) is locally confluent on A if, for every a,6, c G A, the derivations 
a=^ b and a c imply that b =>* d and c d for some d G A; 

(c) is confluent on A if, for every a^b^c E A, the derivations a 6 
and a =^* c imply that b =^* d and c =^* d for some d E A; 

(d) => is Church-Rosser on A if, for every a^b E A, the equivalence a -O'* b 
implies that a =0* c and b c for some c G A. 

Now let B C A. We say that B is closed under =0 if, for every b E B 
and a G A, the derivation b a implies a E B. Note that if 5 C A is 
closed under =0, then (B, =>) is also a derivation system. (Hence, if this is 
the case, an expression like “O- is terminating (locally confiuent, confiuent. 
Church- Rosser) on should be understood.) 

We recall some fundamental results concerning an arbitrary derivation 
system (A, =o) which will be used in the book. 

Lemma 2.3. The derivation relation O is confluent on A if and only if it is 
Church-Rosser on A. 

Proof. First suppose that is Church-Rosser. Let a, 6 and c be elements 
of A such that a =>* b and a =»* c. Then certainly b c and, since is 
Church-Rosser, there exists a d G A with b :^* d and c d, which proves 
that =>► is confluent. 

Next assume that is confluent and let a, 6 G A be such that a b. 
That means a -O'"* 6, for some m > 0. We prove by an induction on m, that 
there exists a c G A with a O* c and 6 O* c. 

For m = 0, we have a = 6, hence the proof is complete by letting c = a(= 

6 ). 

Now let m > 0. Then there is a 6', for which a -O"* ^ 6' -O 6 and thus, 
by the induction hypothesis, there exists a c' G A with a O* c' and V O* c‘ . 
On the other hand, by the definition of -O we have either 6' O 6 or 6 O 6'. 

If 6 O' 6', then, for c = c', we get a O* c and 6 O 6' O* c. 

If the converse, i.e., 6' O 6 holds, then, since also 6' O* c' and since O is 
confluent, there exists a c G A such that c' O* c and 6 o* c. Then we get 
a o* c' O* c, too, implying that O is Church-Rosser. □ 
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The following lemma can be proved by the so-called Noetherian induction 
of which the principle is stated first. 

Principle 2.4. Let (A, be a derivation system such that =>► is terminating 
on A and let P be a predicate over A. The principle of proof by Noetherian 
induction is the following formula. 

If 

1. for every irreducible element a the formula P(a) is true and 

2. for every a £ A, if, for every h with a =»+ 6, the formula P(6) is 
true, then the formula P(a) is also true, 

then, for every a E A, the formula P(a) is true. 



□ 

The next lemma is usually referred to as Newman^s lemma. 

Lemma 2.5. Let be terminating on A. Then =>► is confluent on A if and 
only if it is locally confluent on A. 

Proof. Certainly, if => is confluent, then it is locally confluent, whether => 
is terminating or not. 

Next suppose that =>► is locally confluent and terminating. By Noetherian 
induction we prove that, for every a E A, the predicate P(a): 

If there exist 6, c E -A such that a b and a =>* c, then there exists 
a d E A, for which b =^* d and c =>* d. 

is true. 

First we prove that P{a) is true for every irreducible a £ A. In fact, if a 
is irreducible, then we have a = b and a = c and hence, P{a) is true with 
d = a{= b = c). 

Now let a E A be an arbitrary element and suppose that for every b £ A^ 
with a 6, the predicate P(6) is true. 

Assume that b and c are such that a =>-* b and a =»* c, i.e., that a b 
and a c, for some m, n > 0. 

Consider the case m = 0. Then by definition b = and for d = c, we 
have b = a =>* c = d and c =>* c = d, proving that P(a) is true. 

The case n = 0 can be handled similarly. 

Next suppose that m > 0 and.n > 0 (Fig. 2.2). Then there exist P and 
c' in A such that a =>► 6' ^ a ^ d c. Then, using that 

=> is locally confluent, we have a d! with V =>* d' and c' =>* d'. Next we 
use that, by the Noetherian induction hypothesis, P{b') is true, i.e., there 
exists a d" such that 6 =^* d" and d' =>* d". Note that for d" we also have 
d =^* d' =>►* d". Finally, again by the Noetherian induction hypothesis, P{d) 
is true, hence since d d" and d =>* c, there exists a, d E A^ with d'' d 
and c d. Then we get that P(a) is also true, since b d” d holds. 
By the principle of Noetherian induction, this proves that =>► is confluent. □ 
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Lemma 2.6. Let ^ be terminating on A. Then => is confluent on A if and 
only if every class of contains exactly one irreducible element. 

Proof. First assume to be confluent. Take an arbitrary «^*-class and let a 
be an element of that class. Then either a is irreducible or, by the terminating 
property, there exists an irreducible 6, such that a =^* b. Hence a b and 
thus b is an irreducible element in the class of a. 

Now suppose that b and c are two irreducible elements in the 0'*-class of 
a. Then b c. Since, by Lemma 2.3, => is Church-Rosser on A, we have 
b =>* d and c =>* d, for some d E A. However b and c are irreducible, hence 
6 = d and c = d and thus 6 = c. 

Next assume that each <::»*-class contains exactly one irreducible element. 
Moreover, let a,6, c G A be such that a =>* b and a =^* c. Since is 
terminating, there are irreducible elements b' and c' such that b =^* b' and 
c c'. Then also 6' c', and hence V = c' implying that => is confluent. 

□ 



Corollary 2.7. Let => be terminating and confluent on A. Then, for every 
a G A, the normal form nf{=>^ a) exists. 
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Proof. Let a E A. Then n/(=>,a) = 6, where b is the unique irreducible 
element in the O* -class of a. □ 

Corollary 2.8. Let => be terminating and confluent on A. Moreover, let 
a, 6 G A, such that a =>* 6. Then, n/(=>, a) = n/(=>, b). 

Proof. Since a =>* 6, we also have a b. Hence, by Lemma 2.6, both 
n/(=^,a) and n/(=^,6) are equal to the unique irreducible element in the 
<»*-class of a (and of 6). □ 



2.5 Semigroups, Strings and Languages 

We recall some basic notions and notations concerning semigroups. 

A semigroup is a system (M, •), where M is an arbitrary set and • : 
M X M ^ M is an associative binary operation over M meaning that, for 
all elements a,b,c E M, the equation (a • 6) • c = a • (6 • c) holds. When the 
operation of a semigroup (M, •) is clear, then we drop it and write simply M. 

Let (M, •) be an arbitrary semigroup and let 0 be an equivalence relation 
over M, Then we say that 0 is a congruence relation (or congruence for 
short), if for all elements a, a', 6, 6' G M, the property aOa' and bOV imply 
that (a • 6)0(a' • V). If 0 is a congruence on M, then M/6 = {a/O \ a G M} is 
also a semigroup, called the factor semigroup of M determined by 0, where 
we deflne a/0 • b/0 = (a • b)/0. 

Now let (M, •) and (iV, o) be two semigroups and let /i : M — ^ iV be a 
mapping. Then h is a homomorphism from (M, •) to (iV, o) if for all a, 6 G M, 
the equation h{a • b) = h(a) o h(b) holds. If h is surjective, i.e., h{M) = AT, 
then N is the homomorphic image of M. If N is the homomorphic image of 
M and ft is a bijection, then we say that M and N are isomorphic and we 
write (M, •) = (AT, o) or rather M = N. 

For every homomorphism h : M N ^ the equivalence relation 0h over M 
deflned by a0hb if and only if h{a) = A(6), for all a, 6 G M, is a congruence as 
well. It is called the kernel ofh. Moreover, if h is surjective, then M/0h = N. 

Let (M, •) be a semigroup. Then every subset H C M generates a subset 
[H] = {ai • . . . • Uyn 1 m > 1, a,- G iif, for 1 < i < m} of M . If, especially, 
[H] = M, then we say that H generates M, or alternatively, that H is a set 
of generators of M. 

Now we introduce a special semigroup called the semigroup of strings or 
words. Let A be a nonempty, flnite set of symbols, called an alphabet. A string 
or word over A is a sequence of symbols of the form a\ . . .a^i, where m > 1 
and ai, . . . , Om G A. Let A+ denote the set of all strings over A. Then A“^ is 
a semigroup with the operation concatenation of strings, which for every pair 
of strings w,z £ A“*“, yields the string wz G A“*". Obviously, the concatenation 
is associative. It is also clear that A generates A“^. 

It is a characteristic of the semigroup A"*" that for every other semigroup 
M and mapping f : A ^ M, there is a unique homomorphism h : A"*" — ► M 
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which extends /, i.e., for which f{a) = h(a), for every a E A. Hence, is 
called the free semigroup generated by A. 

Finally, we introduce the concept of a monoid. An element e of an ar- 
bitrary semigroup (M, •) is called a unit element if, for every a G -4, the 
equalities e • a = a • e = a hold. It can easily be seen that, if a unit element 
exists, then it is unique. A monoid is a semigroup with a unit element. 

Let A be an arbitrary alphabet. Let e be the empty string over A, i.e., the 
string which contains no element of A, and let A* = A"*" U {e}. Then every 
subset L of A* is called a language over A. The concept of concatenation is 
extended for languages as follows. For every pair of languages Li, L 2 Q A*, 
we put 

L 1 L 2 = {wz \ w E Li and z E L 2 }. 

It is obvious that s is a unit element with respect to concatenation be- 
cause, for every string m G A*, we have ew = we w. Hence A* is a monoid 
which is called the free monoid generated by A. 

We define the reverse of a string w E A* . 

(i) For u; = ^ we put = w. 

(ii) For a string w = av, where a E A and v G A*, we define 
The length length{w) of a string w E A* is defined as follows. 

(i) For the empty string e we define length{e) = 0. 

(ii) For a string w = av, where a E A and v G A*, we define length{w) = 
1 -f length{y). 

For w E A* and 1 < j < length{w)^ we denote by w{j) the j-ih letter of w. 

We frequently abbreviate consecutive concatenations of a string w E A* 
as follows. For an arbitrary integer n > 0, by u;”, we mean e, if n = 0, or 
if n > 1. 

Let UyV E A*. We say that u is a prefix of v if there is a string x such 
that V = ux. 

For two strings u and u;, we say that u is a substring of w (or u occurs 
in w)j if there are strings x and y such that w = xuy. In particular u occurs 
in It;, if u is a prefix of w. In general, the position of u in m is not unique. 
When we refer to a particular occurrence of u in 1 /; we always mean that the 
surroundings x and y are also given such that w = xuy. 

Let now u,t;, and w be strings such that u and v occur in w. Take a 
particular occurrence of both u and v, which are defined by it; = xuy and 
w = x^vy' . We say that this particular occurrence of u and v do not overlap 
in w if either 

• x'v is a prefix of x^ or 

• XU is a prefix of x'. 
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We note that “do not overlap” is a symmetric relation. 

We now define the concept of the siring subsiiiuiion: Therefore, let w be 
a string, / be a finite index set, and let U = {i/,* | i G /} be a set of nonempty 
strings. Let U be such that 

• for every i^j G /, if U{ = Uj^ then i = j (i.e., the u,’s are pairwise disjoint), 
and 

• for every i,j G /, if both u,* and Uj occur in w, then any two occurrences 
of Ui and Uj do not overlap in w. 

Moreover, let V = {i;,- 1 i G /} be a set of strings. Then we denote by 

w[ui Vi]i G /] 

the string which is obtained from w by replacing every occurrence of u* by 
Vi for every i G /. (We note that the denotation w[ui ^ G /] does not 
necessarily imply that, for every i G /, is a substring of w,) 

If U = {ui, . . . , Un} and V = {i^i, . . . , Vn}, then w[ui i— v,-; z G /] is also 
denoted by w[ui vi, . . . , tin ^ v„] or by v,-; U{ G U]. 



2.6 String Rewrite Systems 

Let A be an arbitrary alphabet. A string rewrite system over A is a finite set 
S C A* X A*. Elements of S are called (rewrite) rules and are written in the 
form ti — ► V, rather than (u, v). 

Any string rewrite system S determines a derivation system (A*, => 5 ), 
where the derivation relation is defined in the following way. For w,z E 
A* , we have w =^5 z if and only if 

• there is a rule ti — ^ t; in 5, 

• there are x and y in A* , 

such that w = xuy and z = xvy. If this is the case, then we say that the rule 
u V is applicable to w. 

We denote the set of irreducible elements of A"^ with respect to =>5 
by IRR{AyS). Moreover, we say that S is terminating (locally confiuent, 
confluent. Church- Rosser), if =>s is so. 

Let 5 be a string rewrite system over A. Then is a congruence over 
the free semigroup A^ and hence A"*"/ is a semigroup too, called the 
semigroup presented by S. Alternatively, S presents the semigroup A“^/ 

A semigroup M is said to be finitely presentable if there is an alphabet A 
and a string rewrite system S over A such that M = A"*"/ 
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2,7 Grammars 

A grammar is a 4-tuple G = {N, i7, P, 5), where 

(1) N is an alphabet, called the nonterminal alphabet, 

(2) U is another alphabet, disjoint from N, called the terminal alphabet, 

(3) P, called the set of rules, is a finite subset of the set 

{N U E)* N{NU E)* x{NU E)* , 

i.e., it is a finite set of elements of the form a ^ /3, where ct,P E {NKJE)* , 
such that a contains at least one nonterminal. 

(4) 5 is a designated element of N called the start symbol. 

The grammar G induces a derivation system {{N U i7)*,=^c?), where the 
derivation relation or shortly is defined in the following manner. For 
any j,6 E (N \J E)* , we define 7 => 5 if and only if 

• there is a rule a ^ /? in P, and 

• there are 71 and 72 in {N U E)* such that 7 = 7ia72 and 6 = 71/^72 . 

Then G generates a language which is defined by 

L{G) = {weE*\S=>*o M- 

In this book we are interested in language classes generated by some 
restricted grammars. First, we define context-free grammars. A grammar is 
context-free if every rule in P has the form A — ► a, where A E N. A language 
L C E* is called context-free if it can be defined by a context-free grammar, 
i.e., there is a context-free grammar G such that L = L{G). The class of all 
context-free languages is denoted by £c/- 

Further, a context-free grammar is called linear if, for every rule A — ► 
a, the string a contains at most one symbol from N. In other words a E 
(E^^NE*) U E* holds. The class of linear languages is defined in an obvious 
way and it is denoted by Cun • 

The third language class we will consider is defined as follows. A grammar 
G as above is called regular if every rule in P has either the form A — ^ wB 
or the form A w, where A,B E N and w E E*. The class of languages 
generated by regular grammars is called the class of regular languages and it 
is denoted by Creg- 
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In this section we collect definitions, notions, and notations concerning trees, 
the principles of definition and proof by structural induction, properties of 
trees, and tree transformations. 
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Trees 

A ranked set is a tuple (f2, rank) where 12 is a (possibly infinite) set and 
rank : 12 — ► N is a mapping called the rank function. If u; G f2 is such that 
rank{u) = Ar, then we say that the rank of a; is Ar or that lo has rank k. For 
every A: > 0, we define {2^^^ = {a; G ^2\rank{u)) = A:}. Thus, the fact that 
a; has rank k can also be expressed by writing u G Hence we can drop 
rank from the specification (f2, rank) by just saying that 12 is a ranked set. 
Moreover, we also write to denote the fact that u has rank Ar, i.e., that 
u G Sometimes we also denote the rank function by rankp. 

In this book we will need an infinite supply of ranked symbols. 

Hence we now take and fix an infinite ranked set Q having the prop- 
erty that, for every Ar > 0, is also infinite. 

By a ranked alphabet we mean a finite subset E of X2, where ranks are 
taken over from f2, i.e., we put = i7 fl f2(*) for every Ar > 0. With this 
definition of a ranked alphabet we state that the rank of a symbol cr is the 
same in all ranked alphabets to which it belongs, i.e., the one taken over from 
f2. 

A ranked alphabet E is called unary if E = E^^\ i.e., every a £ E has 
rank 1. Moreover, E is monadic if E^^^ contains exactly one element and, for 
every n > 2, the set E^^^ is empty. 

Let 17 be a ranked alphabet and let A be a set disjoint with E. The set of 
(finite f labeled, and ordered) trees over E indexed by A, denoted by T 27 (A), 
is the smallest subset T of (i7 U A U {(, )} U {, })*, such that 

(i) ACT and 

(ii) if <7 G i7(*) with Ar > 0, and ti , . . . , tjb G T, then (r{ti , . . . , t*) G T. 

In the case Ar = 0, we denote cr{) by a. Moreover, Ti7(0) is denoted by Tjj. 
It should be clear that Tj; = 0 if and only if E^^^ = 0. Since we are not 
interested in this particular case, we assume that E^^^ / 0 for every ranked 
alphabet appearing in this book. 

A set L C Ts is called a tree language. 

For trees over a monadic ranked alphabet, we introduce a simpler deno- 
tation as follows. Let i7 be a monadic ranked alphabet and t £ T^. Then 
either t £ E^^^ or t = <7i(<72(. . . . .)), where n > 1, (Ti, . . . , cr„ G i7^^) 

and (T £ E^^\ In the latter case we also write t as <7 i(T 2 . . .(Tn((7), or as 
(71(72 . . . (7n(7 for the sake of simplicity. 

We will need sets of variable, symbols to specify different kinds of tree 
transducers. Such a set of variable symbols is X = {a^i, a? 2 , • . .} and later 
we will specify some of its isomorphic copies. The elements of X will be 
called variables. For every A: > 0, the set of the first k variables in X will be 
denoted by Xk, hence Xk = {a?i, . . . , a?jb}. Note that Xq = 0. These notations 
for variables will be kept fixed throughout the book. The variables will occur 
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in trees, so we will frequently consider trees from sets of the form Ts{X)f 
Ts{Xk)i etc., where i7 is a ranked alphabet. 

If Q is a unary ranked alphabet and A is a set of symbols, then Q{A) 
denotes the set {g(a) | g G Q, a G A}. 

Since trees are also strings, the concept of string substitution defined in 
Sect. 2.5 applies to trees as well. This operation will frequently be used in the 
special case when variables in a tree will be substituted by other trees. For 
example, let t G Tjj{Xje), for some ranked alphabet E and fc > 0. Moreover, 
let ti, . . . , tjb also be trees over a (possibly different) ranked alphabet indexed 
by a set. Then, according to the concept of string substitution defined in Sect. 
2.5, the notation t[xi ^ ti, . . . , xjb tk] means the tree which is obtained 
from i by substituting, for every I <i < ky the tree U for each occurrence of 
Xi in t. 

Now let i7 be a ranked alphabet. Moreover, let Z = {zi , Z 2 , • • •} be another 
set of variables, disjoint with X, and let > 0. The set of (i7, k)-contextSy 
denoted by Cjj^ky is the set of trees t over E indexed by Zk such that for every 
I <i < ky the symbol z,- occurs exactly once in t. Note that a {Ey fe)-context 
t can also be considered as a string over the alphabet Z'U {zi, . . . , z*} U {, } U 
{(,)}• 

For contexts, we use a simpler notation for the substitution. Namely, let 
t be a {Ey Ar)-context and let ti, . . . , tjb be trees. Then we abbreviate t[z\ 4 — 
<-<*•] to i*]. 

structural Induction 

Next we recall the principle of structural induction which is a straightforward 
generalization of the (usual) principle of induction on the set N of natural 
numbers. Let us briefly recall the principles of definition by induction and 
proof by induction. 

Principle 2.9. Let C be a set. By inductiony a mapping / : N — ► C can be 
defined. The principle of definition by induction is: 

If 

IB: /(I) is defined and 

IS: for every n > 1 , if f{n) is defined, then f{n + 1) is defined, 

then for every n > 1 , /(n) is defined. 



□ 

The abbreviations IB and IS stand for induction base and induction step, 
respectively. 

Principle 2.10. By inductiony a predicate P over N can be proved. The 
principle of proof by induction is: 

If 

IB: the formula P(l) is true and 
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IS: for every n > 1, if the formula P{n) is true, then the formula 
P(n + 1) is true, 

then for every n > 1, the formula P{n) is true. 



□ 

Now the generalization means replacing the initial algebra which is gener- 
ated by the particular ranked alphabet by the initial algebra 

which is generated by an arbitrary ranked alphabet E. 

Principle 2.11. Let 17 be a ranked alphabet and let A, C be sets. By struc- 
tural induction, a function 



f:Ts{A)^C 



can be defined. 

The principle of definition by structural induction is: 

If 

IB: for every a £ AU the value /(a) is defined and 
IS: for every k > I, a e and si, . . . , sjb E Tjj(A), if the values 
/(si ),..., /( sa;) are defined, then the value f{a{si, . , . ,Sk)) is 
defined, 

then for every s E Ts{A), the value f{s) is defined. 



□ 

Principle 2.12. Let 27 be a ranked alphabet and A be a set. By structural 
induction, a predicate P over Ts{A) can be proved. 

The principle of proof by structural induction is: 

If 

IB: for every a £ A U the formula P{a) is true and 
IS: for every fc > 1, cr E 27^*\ and si , . . . , Sjb E Ts(A), if the formulas 
P(si), . . . , P{sk) are true, then the formula P((t(si, . . . , Sk)) is 
true, 

then for every s E Ts{A), the formula P{s) is true. 



□ 

We note that the principle of proof by structural induction is a special 
case of principle of proof by Noetherian induction (Principle 2.4), where the 
basic set is Ts{A) and the derivation relation is the following: for a pair of 
trees t,f E Ts{A) we define t => f if and only if f is a direct subtree of t. 

Properties of Trees 

Let 27 be a ranked alphabet and A be an arbitrary set. By structural induc- 
tion, we define four characteristics of a tree t E Tjj{A): the height oft, the size 
oft, the set of occurrences oft, and the set of subtrees of t. Formally these 
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can be described with the functions height : T^{A) — ► N, size : Ts{A) N, 
occ : Ts{A) — ► and sub : Ts{A) — ► V{Ts{A))y respectively, as follows. 

IB: lUeS^^^UA, then 

• heighi{t) = 1, 

• size{t) = 1, 

• occ{t) = {e}y 

• sub{t) = {t}. 

IS: If ^ = (t(si , . . . , sjb) where a G k >1 and si, . . . , sjb G Tjj{A), then 

• height{t) = 1 -|- max {height {si) | I <i < k}, 

• size{t) = 1 + Ei<*<ib size{si), 

• occ(t) = {e} U {u; I ti; = 2v, 1 < 2 < Ar, t; G occ(s,)}, 

• su6(<) = {4uUi<.<jb««K«<)- 

We note that this definition of height is different from that used in Chapter 
1. 

Also we define the label at an occurrence of a tree t G Ts{A). More 
precisely, the mapping 

label : {(t, w)\t E Ts{A)^ w G occ{t)} SU A 

is defined by structural induction for every t G T^{A) and w G occ(t). 

IB: lit E U A (and thus w = e), then /a6e/(t, w) = t. 

IS: If ^ = cr(si, . . . , Sib) for some cr G with Ar > 1 and si, . . . , sjb G Ts{A), 
and 

• if u; = 6:, then label{t^ w) = a, otherwise 

• if It; = iv for some 1 < 2 < A:, then /a6e/(t, w) = label{ti^v). 

We will also need the concept of the set of variables occurring in a tree 
t G Ts{X). This is defined with the function Varx : Tj]{X) — ► V{X) in the 
following way. 

IB; Ift € UX, then if t G Varx{t) = 0, otherwise Varx{t) = {<}. 
IS: If t = a{ti , . . . , tjfc), for some a G 1^'°^ with fc > 1 and <i , . . . , t* G Ts{X^, 
then Varx{t) = Ui<j<t y<^rx{ti)- 

The following lemma states a basic relation between the height and the 
size of a tree over a ranked alphabet. 

Lemma 2.13. Let 17 be a ranked alphabet. Then there is an integer c > 1 
such that, for every s G size{s) < 

Proof. In fact, we prove somewhat more than we have stated. Namely, we 
show that size{s) < c^^* 9 ht(s) if ^ and that size{s) < c^^* 9 ht(s) if 

Let us define c by c = max{ k \ ^ 0 } + 1. 

If i7 = then c = 1 and, for every s height{s) = size{s) = 1. 

Then our statement obviouslv holds for everv s. 
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Assume now that S ^ (Note that in this case c > 1.) We prove by 
structural induction on s. 

Let s G Then, since size{s) = height{s) = 1 and c > 1, we have 
size{s) < c^^isht(s) 

Let s = (t(si, . . .,sjb), for some fc > 1, (t G and si, . . .,sjb G Tjj. 
Moreover, let io with 1 < io < k he such that heighi{si^) > height{si)^ for 
every 1 <i<k. Then we can compute as follows. 





size{s) 




= 


1 + E*=i size{si) 




< 


^ ^height(ai) 


(by the induction hypothesis and by Ar > 1) 


< 


. ^heighi(siQ) 


(by the definition of io) 


< 


^ . ^height(8i^) 


(by ok) 


— 


^height(siQ)-\-l 




= 


^height(s) 


(by the definition of height{s)) 



This proves our lemma. □ 

Tree Transformations 

The concept of the tree transformation is central to this book. A tree irans- 
formation is a mapping r \Ts for some ranked alphabets E and A. 

The class of all identity tree transformations, denoted by ID, is the class of 
all relations Id{Tjj) for some ranked alphabet E, 

The following lemma expresses the obvious property of tree transforma- 
tions that every tree transformation can be written as the composition of 
itself and a suitable identical tree transformation. Since this fact will be used 
in the book, we state it explicitly. 

Lemma 2.14. Let C, D, and E be classes of tree transformations. If C C D 
and ID C E, then C C Do E and C C E o D. 

Proof. Assume that the conditions of our lemma hold. Take a tree transfor- 
mation T :Ts —^Ta from C. Then also r E D. Moreover, the identical tree 
transformations l = Id{Ts) and = Jd(T^) are in E and both r = r o l' 
and T = tor hold. Therefore, both t £ D o E and r £ E o D, □ 

The above lemma will be frequently used in the special case where C = D. 
Then it expresses that both C C C o E and C C E oC providing ID C E. 
Moreover, if ID C C, then C CC oC also. 




3. Top-Down Tree Transducers 



In this chapter we introduce the concept of a top-down tree transducer. It is 
a formal model for syntax-directed semantics and it is created by abstraction 
from the model called attribute grammar with synthesized attributes only. 
The structure of the chapter is as follows. 

1. Basic definitions 

2. Induced tree transformation 

3. Characterization of top-down tree transformations 

4. Height property 

5. Subclasses of TOP 

6. Composition and decomposition results 

7. Composition semigroup generated by TOP, l-TOP, and HOM 

8. Bibliographic notes 



3.1 Basic Definitions 

We start by defining a set, called the set of right-hand sides, of which the 
elements may be right-hand sides of rules of a top-down tree transducer. 

Definition 3.1. Let Q be a unary ranked alphabet, ^ be a ranked alphabet 
disjoint with Q, and fc > 0 be an integer. The set RHS{Q, A, k) of right-hand 
sides over Q,A, and k is the smallest subset RHS of Tg^Ai^k) satisfying 
the following two conditions: 

(i) For every p EQ and I < i < k, the term p{xi) is in RHS. 

(ii) For every 6 G A^^^ with / > 0 and , . . . , 6 ^ RHS, the term ^ (^i , . . . , 

is in RHS. □ 

We note that, for every k > 0, Ta Q RHS{Q, A, k) and that in particular 
Ta = RHS{Q,A,0). 



We now define the concept of a top-down tree transducer. 
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Definition 3.2. A top-down tree transducer is a tuple T = (Q, E, A, qo, iJ), 
where 

• Q is a unary ranked alphabet, called the set of states^ 

• E and A are ranked alphabets with Q C\{E U A) = 0, called the input 
ranked alphabet and output ranked alphabet, respectively, 

• go is a designated element of Q, called the initial state, 

• iZ is a finite set of rules such that, for every q £ Q and a G E^^^ with 
A: > 0, there is exactly one rule of the form 

9(<7-(a;i,...,a:*)) 

in R with ^ G RHS{Q, A, k). □ 

A rule as above is called the (g, (r)-rule or the g rule for a. Moreover, ^ is 
called the right-hand side of the rule and is also specified as rhs{q,o’). Thus 
the expression ^ = rhs{q,a) reads as ^ is the right-hand side of the {q^cry 
rule. Obviously, not every element of RHS{Q, A, k) is the right-hand side of 
some rule in R. 

A top-down tree transducer can transform input trees into output trees 
and hence it realizes or induces a tree transformation. (The exact definition of 
the tree transformation induced by T will be defined later, after the necessary 
groundwork.) Trees over E, i.e., the elements of Tjj are called input trees to 
T. An input tree will be transformed into an output tree, which is an element 
of Ta • Thus output trees are those trees over A which can be obtained by 
the tree transformation induced by T from an input tree. 

We mention that in the theory of top-down tree transducers, a more 
general terminology is used. Among others there is no restriction on the 
number of (g, (r)-rules in R. Moreover, if a top-down tree transducer is such 
that, for every g G Q and a £ E, there is at least one (g, (r)-rule in R, then it 
is called total. On the other hand, if, for every q E Q and a £ E, there is at 
most one (g, u')-rule in R, then it is called deterministic. Thus, the top-down 
tree transducer we have defined is both total and deterministic. In fact, this 
is sufficient as was shown by the discussion in Chap. 1. 

Next we give an example for a top-down tree transducer. 

Example 3.3. Let T' = (Q, E, A, go, R) be such that 

• Q = {go,9i,?2}, 

• 27= {a(°), 7 W} and A = 

• R is the set of the following rules 

1) qo(a) ->■ a, 

2) 9i(a)-^A, 

3) q2{a)-*02, 

4) 9o(7(®i)) %i(a:i),92(a;i)) 

5) 91 ( 7 ( 2 : 1 )) -^ 7(91 (ici)) 

6) 92(7(2:1)) -^7(92(2:1)). a 
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3.2 Induced Tree Transformation 

The discussion in this section concerns an arbitrary, but fixed top-down tree 
transducer 

T={Q,E,A,qo,R). 

We define the tree transformation induced by the top-down tree trans- 
ducer T according to the following schedule. First we define the derivation 
relation induced by T. Then we define the set of sentential forms of T and 
show that the derivation relation is terminating and confluent on sentential 
forms. Finally, the tree transformation induced by T will be defined as a 
mapping that associates to every input tree s £Ts the normal form of the 
tree qo{s) with respect to the derivation relation. 

Hence we start by defining the derivation relation induced by T. 
Before inspecting the definition, the reader is advised to consult Sect. 2.8 
for the definition of a context. Figure 3.1 illustrates the definition of the 
derivation relation. 

Definition 3.4. The derivation relation induced by T is the binary relation 
=>T over the set Tqulua such that, for every E Tqusua we define 
(p =>T if 

• there is a context /? G Cqusua,!^ 

• there is a state g G Q, 

• there is a fe > 0 and a G 

• there are trees si , . . . , sjb G Tj; , 

such that (p = P[q{(T{si , . . . , Sk))] and rp = <— si , . . . , xjt ^ sjb]], where 

^=zrhs{q^a). □ 

In the above case we say that the rule q{o’{xi , . . . , Xjb)) — ^ ^ is applicable to 
(p. It can be seen that is a derivation system in the sense of 

Section 2.4. 

The necessary action to obtain xp from i.e., finding the context /? and 
the subtree 9(<r(si, . . .,sa;)) to be substituted as well as substituting ♦— 
^ ^ for 9(<r(si, . . is called a derivation step. We now show 

an example for a single derivation step and a sequence of such steps. 

Example 3.5, Let us consider again the top-down tree transducer of Example 
3.3. Let p = 5(91(7(0)), 7(g2(ct)))- Then, for the context 0 = 5(2:1,7(92(0))) 
and the tree s\ = a, we have p = /^[9i(7(^i))]* Moreover, rhs(gi,7) = 
7(91 (xi)). Then the relation p holds , where 

V> = /?[»-/»s(9i.t)[^Ci si]] 

= ^(2i.7(92(a)))[7(9i(®i))[®i <-«]] 

= ^(zi.7(?2(a)))[7(9i(a))] 

= ^(7(9i(«)).7(?2(a)))- 
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Fig. 3.1. A derivation step induced by T 



Next we show a sequence of derivation steps for the top-down tree transducer 
appearing in Example 3.3 (Fig. 3.2). We can write the following. 



9o(T^(a)) =^T' 



%i(7(a)),g2(T(«))) 

^(7(gi(a)),?2(T(«))) 

<5(7(?i(a)),7(g2(a))) 

%(A).7(?2(a))) 

^(7(/?i),7(^2)) 



(by rule 4) 
with P = z\ 

(by rule 5) 

with/? = 5(zi,g2(7(of))) 

(by rule 6) 

with /? = 6(7(51 (a)), zi) 

(by rule 2) 

with P = S{y{zi)),^{q2{a))) 
(by rule 3) 

with /? = 6(7 (A),t(-2^i)) 



This example shows a particular property of a top-down tree transducer: it 
can make copies of a subtree of the input tree (viz. the two copies of 7(0)) 
and then process these copies with different states (viz. the states 51 and 52) 
and produce different subtrees (viz. j{Pi) and 7(^32)) of the output tree. □ 





3.2 Induced Tree Transformation 



65 




In order to define the tree transformation induced by T we do not need 
to consider over the whole set Tguru^i- A smaller set which is called the 
set of sentential forms of T will be sufficient. The set of sentential forms can 
be thought of as the set of trees that can appear in derivations concerning 
the top-down tree transducer T. 

Definition 3.6. The set of sentential forms ofTy denoted by 5F(T), is the 
smallest subset SF of Tquekja for which the following two conditions hold. 

(i) For every q £Q and s £Te, the tree q{s) is in SF. 

(ii) For every 6 G with / > 0 and , . . . , y?/ G 5F, the tree 6{(p \ , . . . , y?/) 
is in SF. 

Moreover, a tree of the form go(«)) where s G Tj;, is called initial sentential 
form. We note that SF{T) C Tqusua- ° 

The following lemma is needed for technical reasons. It expresses the close 
connection between the set of possible right-hand sides and the set of sen- 
tential forms. Namely, each sentential form can be obtained by substituting 
input trees into a suitable right-hand side and vice versa; each tree obtained 
by the substitution described above is a sentential form. 

Lemma 3.7. The following two statements hold for the top-down tree trans- 
ducer T. 

(a) For every ^ G RHS{Qj A, k) with k >0 and si , . . . , s* G Ti; it holds that 
^[xi ^ Sly...yXk ^ Sk]e SF{T). 

(b) For every (p G 5F(T), there exist fc > 0, ^ G RHS{QyAyk)y and 

«i , • • • , G Ti 7 such that (p = ([xi ^ si, . . . , ^ Sk]. 
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Proof, (a) The proof is by structural induction on 

(i) If ^ = g(^i) for some 1 < i < fc, then ^[x\ ^ si, . . . , xjb <— Sjb] = 9(^») 
and hence ^[xi ^ si, . . . ,Xk Sk] E SF{T). 

(ii) Now let ^ = ^(^i, . . . for some / > 0, 5 G and . . . ,6 G 

RHS{Q,A,k), Then we have ^[xi ^ ^ s*] = ^ 

si, . . . , xjb ^ Sk], . . . ,6[^i ^ «i, . . . , xjfc <«— sjb]). Since, by induction hypoth- 
esis, for every 1 < i < /, the term ^j[xi ^ si, . . . , xjb sjb] is in SF{T) we 
find that ^[xi «i, . . . , Xjfc sjb] is also in SF{T), 

(b) Structural induction on (p. 

(i) Let (p = q{s) for some q E Q and s ETs- Then letting = 1, ^ = qi^i), 
and si = 5 we have p = ^[xi ^ si]. 

(ii) Suppose now that p = 6{pi^ . . . ^pi) for some 6 G with / > 0 

and pi,...,pi G SF{T). By induction hypothesis, for every 1 < i < /, 

there exist ki > 0, G RHS(QyA,ki)^ and sn,..,,Siki E Ts such that 
P>i = ^ 5*1, • • . , Xki ^ SikiY Let, for every 1 < i < /, 

k = ki -h ... -h k/j and si = sn, . . . , = ^iki • Then, for every 1 < z < /, it 

holds that 



^j[xi < S,*i, . . . , Xki ^ ^*A;J — ^*[^1 ^ } • > • } Xk ^ S^]. 

Now define ( = 6((Y . . . and, let k and Si, . . . , sa: be as above. Then we 
can compute as follows 

^[Xi ^ a?Ar ^ Sjfe] 

= ^Si,...,Xik ^ Sib],...,^/[iPi <-si,...,Xk ^Sk]) 

= ^(6[2^1 ^ 511, • • • ^ SlJfci], • • ^ ^Ily-iXk, Slki]) 

= S(pu...,pi) 



In the following lemma we prove that the set of sentential forms is closed 
with respect to the derivation relation, i.e., that (5F(T), ^t) is also a deriva- 
tion system. This will also prove that, starting from a sentential form, we can 
always obtain sentential forms by applying derivation steps. In particular, this 
will be true for initial sentential forms. 

Lemma 3.8. SF(T) is closed under =^t- 

Proof. We show that, for every sentential form p G SF(T) and term rp E 
TQusuAi if ^ =>T i’, then xp E SF{T). The proof is performed by structural 
induction on p, 

(i) Let p = q{s)y where q E Q and s = <r(si , . . . , 5jb) G Tj;, for some 
(T G and si, . . . , Sjb ETs- Then, letting p = c[q{a{si ,, . . , Sk))], where c 
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is a context in Cguru^.i, we must have c = zi. Hence, by the definition of 

=>T, 

V’ = c[^[®l <- St]] = ^[*1 ^Si,...,Xlfe <-Sjfc], 

where ( = rhs(g, a) and thus, by Lemma 3.7, rp G SF{T). 

(ii) Now let (p = 6((pi, . . . ,(pi), for some S G with / > 0 and 
G SF{T), Then, by the definition of (p =^t ^ means that 
(pi =>T V’*, for some 1 < i < / and rpi G Tquuua- By induction hypothesis, 
G SF{T). On the other hand, 

tp = 6(^1, . . . , (pi-i,rpi, (pi^i, . . . , pi), 

and hence 'tp G SF{T). □ 

In the following we are going to prove that the derivation relation is 
terminating and confiuent on SF{T), We start by proving local confiuency; 
then confiuency follows from the termination and Lemma 2.5. 

Lemma 3.9. The derivation relation is locally confiuent on SF{T). 

Proof. Let and p 2 be sentential forms of T such that p Pi and 

P =>T P 2 - We show that there exists a sentential form tp G SF{T) such that 
(pi tp and p 2 V'* 

By Def. 3.4, for i = {1, 2}, 

• there is a context c» = Cguru^.i, 

• there is a state qi G Q 

• there is a A:,* > 0 and a (Ti G 

• there are trees s*i , . . . , Siki G Tjj , 

such that 

<p = Ci[9i(o-i(sii,...,SitJ)] = C2[?2(0-2(s21,...,S2ibj))], 

V’l = ci[^i[a!i <— sii, . . .,xtj sifc,]], 

V>2 = C2[^2[®1 <— «21 , . . . , a:*, <— S2ib3]], 

where = rhs{qi, (t,), for i = {1, 2}. 

Consider the occurrences W{ leading from the root of c,- to the only z\ in 
c,'. We make two observations about wi and W 2 . 

First we observe that if u;i = u; 2 , then c\ = C 2 , gi = ^ 2 , Ari = Ar 2 , 

and sii = S 21 , • • • , siJki = S 2 Jk 2 - Consequently = ^ 2 , hence pi = p 2 and we 
have ^p = pi{= P 2 )- 

Next we show that neither W 2 is a proper prefix of wi, nor wi is a proper 
prefix of u; 2 ; then we say that w\ and W 2 are disjoint occurrences. Suppose 
on the contrary that W 2 is a proper prefix of wi . Then there would exist a 
context c G Cgui;u^,i such that c ^ z\ and ci = C 2 [c]. Moreover, from 
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V? = Ci[gi(eri(sii,...,sitj)] 

= C2[c[gi(<Ti(sii,...,SitJ)]] 

= C 2 [g 2 (o‘ 2 (« 21 , • • •iS 2 fca))] 

we would find that 

c[gi(cTi(Sii, . . = q2{(T2{s2l, . . •,52*2))* 

This would yield that the root of c is also q 2 which contradicts the supposition 
that (p 6 SF{T). In a similar way, we can show that w\ cannot be a proper 
prefix of W 2 either. 

Hence we may assume that w\ and W 2 are disjoint occurrences. Then, since 
both w\ and W 2 are also occurrences of there is a context c G C'gui7u^,2 
such that 

(f = c[z\ ^ . . .,SiJbJ),Z2 ^ 92(^^2(521, • 521:3))], 

(pi = c[2Tl ^ Sil, . . 5iJbJ,Z2 ^ 92(^^2(521, • • • ,521:3))], 

(f 2 = c{zi ^ 9l(^l(«ll, • • •,«ll:i)),^2 <-6[a^l ^«21,...,iPl:3 ^ ^21:3]]* 

Then it is easy to see that, for 

= c[2Ti ^ 511,. . ^ 5iJkJ,2:2 ^ ^2[xi ^ 521, . . • , ^ ^ 21 : 2 ]], 

we have (pi =^t V’ and <p 2 ^ 

In order to prove statements concerning trees, like the terminating prop- 
erty of =>-T on 5F(T), we need another kind of proof technique. This is called 
proof by simultaneous induction, the principle of which will be presented now. 
As a preparation, we define two families of predicates, which will be required 
to form the statements precisely. 



Definition 3.10. Let 17 be a ranked alphabet. Let A be a set and, for every 
a E A, let Ka ' Ts {true, false} be a predicate over T^. Moreover, for 
every A: > 0, let Bk be a set, and, for every b £ Bk, let Lh,k • 

{true, false} be a predicate. Let 

K = {Ka\aeA} 

and let 

L = [Lb^k \ be Bk for some k > 0}. 

We say that K holds if, for every a G A and s E Tjj, Ka{s) = true. 
Similarly, we say that L holds if, for every k > 0,b E Bk, and 5i , . . . , ^Tz;, 

Lb,k{{si,...,Sk))=true, □ 

Thus simultaneous induction is a technique which can prove simultane- 
ously that families of predicates like K and L hold. This technique generalizes 
the proof by structural induction (see Principle 2.12). The principle of simul- 
taneous induction is as follows. 
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Principle 3.11. Let K and L be two families of predicates as in Def. 3.10. 
The principle of proof by simultaneous induction is: 

If 

IB: for every 6 G Bo) the formula Lft,o(( )) = true and 
ISl: for every a G A, fe > 0, and «i,...,sjb G Ts, if for every 
6 G Bjb, L6,jk((si, . . .,5jb)) = true, then, for every a G 
Ka{o’{si, . . = true and 

IS2: for every k > 1 and si,. ,.,Sk ETs, if for every ai, . . . , a* G A, 
Kai{si) = true, . . ,,Kaj,{sk) = true, then, for every b E Bk^ 

Lft, sjb)) = true, 
then K and L hold. 



□ 

The abbreviations IB, ISl, and IS2 stand for induction base, induction step 
1 and induction step 2, respectively. 

Note 3,12. In concrete applications of Principle 3.11 throughout the book, 
the sets Bk,k > 0 will form a hierarchy, i.e., Bk C Bjk+i for every k > 0 
(see Sect. 2.2). Hence, for any A: > 0 and 6 G Bjb, it also holds that, for every 
I > k, b E Bi. Moreover, for every k >0 and b E Bk, the predicates Lb^k will 
have the following property: for every I > k and sequence s\, . . .,si of trees 
in Ts, 

Lb,k({si, . . .,Sjb)) = Lbj{{si, . ..,«/)). 

This, and the fact that the arity of a predicate will always be clear from 
the context, namely from the specification of its arguments, encourages us to 
drop the index k from Lb^k in concrete applications of Principle 3.11. Thereby 
our formula becomes more readable. □ 

We can now prove by simultaneous induction that the derivation relation 
of a top-down tree transducer is terminating on the set of sentential forms. 
In the proof we will show how Principle 3.11 is applied to this concrete case. 
However, later, we shall leave such details to the reader. 

Lemma 3.13. The derivation relation is terminating on SF(T). 

Proof. We prove that, for every sentential form (p, there exists an integer g,p 
such that, for every derivation (p = (p^,(p\, . . . , (pm starting from (p, we have 
m < g^p. 

To see this, we prove by simultaneous induction that K and L hold, where 
K and L are defined in the following way. Let 

• A = Q and 

• for every q E Q, let Kq be the predicate over Ts defined such that, for 
every s G Tj 7 , Kq{s) = true, if and only if there is an integer gq^a such that 
the derivations starting from q{s) are not longer than 

• Let K = {K,\qEQ] . 
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Moreover, 

• for every A: > 0, let Bk = RHS{Q, A, k) and 

• for every ^ G RHS{Q^A^k)^ let be the predicate over (Tr)* defined 

such that, for every cj = (si, . . .,sjb) G = true if and only 

if there is an integer such that the derivations starting from 

^ Sk] are not longer than 

• Finally, let L = {L^ [ ^ G RHS{Q, A, kj for some fc > 0}. 

First we prove IB. As we noted after Def. 3.1, RHS{Q^A,0) = and 
thus ^ G Moreover a; = (). Then = 0 is suitable, since the only 
derivation starting from ^ is the sequence 

Next we prove ISl. Let s = <r(si, . . . , sjb), for some a G and 

E Ti 7 , and let lj = (^i, . . . , sjb)- Moreover, let be 

a derivation sequence starting from q{s). We observe that, by the definition 
of ^T, (pi = ^ si^,..,Xk ^ Sjb], where ^ is the right-hand side of 

the {q, cr) rule in R. On the other hand, ^i, . . . , ipm is a derivation of length 
m— 1 starting from (pi. Thus, by the induction hypothesis on L, there exists 
an integer g^^u>i such that m — 1 < g^^^i hence, for gq^s = 1 + g^,LUi we have 
m<gq^s- 

The IS2 is proved by induction on the structure of ^ . 

(i) Let ^ = g(ar»), for some q eQ and 1 < z < fc. Then ^[x\ ^ si , . . . , a:* ^ 
^k] = Q{si) and thus every derivation starting from ^ [a?i ^ si , . . . , arjb ^ s*] in 
fact starts from g(5,). Hence, by the induction hypothesis on g^^u; = 9q,si 
is suitable for our purpose. 

(ii) Now let ^ = 5(^i, . . . ,^/), for some 6 G A^^\ I > 0, and 6, ... ,6 E 

RHS(QyA,k). Then we have si,...,a?jb ^ Sk] = ^ 

si,..,,Xk ^ Sk]y. . ^ Si,...,xjfc ^ Sk]) and, by induction hypoth- 

esis on for every !<*</, there exists an integer such that the 
derivations starting from (i[xi ^ si , . . . , xjb ^ Sk] are not longer than g^i^u;- 
On the other hand, we note that, for every derivation 

^[xi Si,.,,,Xk ^ Sk] 

— 5(^i[a?i ^ si,.,.,Xk ^ Sjb], . . .,^/[a?i si, . . . , a?jb ^ sjb]) 

starting from ^[xi ^ si^.,.,Xk ^ «jb], and 0 < z < m, we have pi = 
^(Ctij • • • ) Cii)i for some sentential forms C*D • • • ? Cu ^ SF{T). Moreover, by 
the definition of > for each 0 < z < m — 1 , there is exactly one I < j < I 
such that Cij =»T C(*+i)i siJrd Qjt = C(i+i)iS for every other f with 1 < / ^ 

j < 1. Therefore g^^u; = 5Z!=i suitable. (Note, that in the particular 

case / = 0, we define g^^^f — 0). 

We now finish the proof of the lemma. Let (p be an arbitrary sentential 
form. Then, by Lemma 3.7, there exist fc > 0, ^ G RHS{Q,A,k)y and a 
sequence u = (si,...,Sk) E (Ts)^ with p = ^[xi <r- si,...,Xk ^ s*]. We 
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have proved that L holds, hence any derivation starting from (p is not longer 
than hence we have Qtp = ^ 

We observe that, in the proof by simultaneous induction that K and L 
hold, the proof of the items IB and IS2 can usually be performed in one 
step. This is because IB can be incorporated into IS2 by extending IS2 to 
the case Ar = 0. Actually, IS2 in the case fc = 0 means that, for every b E Bq^ 
Lb ({ )) = true, which is exactly IB. Therefore we will now usually incorporate 
the proof of IB in that of IS2. 

Lemma 3.14. The derivation relation =>t is confluent on SF{T), 

Proof. By Lemmas 3.9 and 3.13, =>t is locally confluent and terminating, 
respectively. Hence, by Lemma 2.5, =^t is also confluent. □ 

The following easy technical lemma is only a preparation for showing that 
the normal form of every sentential form is in T^. 

Lemma 3.15. Let 6 € / > 0, and ^i, . . . ,^/ G SF{T). If ^i, . . . ,^/ E 

SF{T) have normal forms, then 

nf{=>Ty6{^i, • • - , 6 )) = ^(^/(=>^ t , 6 ), • • - 

Proof. Our statement follows from the fact that a rule can only be applied 
to a sentential form in such a place where a state occurs. Hence, the symbol 
6 at the root of 5(^i, ... ,6) remains unchanged during the derivation steps. 
□ 



The following theorem is the basis of defining the tree transformation 
induced by a top-down tree transducer. 

Theorem 3.16. For every sentential form (p G SF(T), the normal form 
nf{=^Ti^) exists. Moreover, n/(=^T)^) € Ta- 

Proof. By Lemmas 3.13 and 3.14, the relation =>t is confluent and termi- 
nating on SF{T), Then, by Corollary 2.7, the normal form nf(=^Ti^) exists, 
for every <p G SF{T). 

In order to prove that n/(=^T,^) G T^, we define the predicates K and 
L and then show by simultaneous induction that they hold. 

K : For every q E Q and s = cr{si, . . . , Sjb) G Ts, it holds that n/(=^T 
,q(s)) E Ta- 

L : For every Ar > 0, ^ G RHS{Q^A^k), and si,...,sa; G T^, it holds 
that n/(=^T,^[ari ^ «i, . . . , ajjb ^ Sk]) G Ta- (Here we use that, by Lemma 
3.7, ^[a:i ^ 5i, . . . , x* sjb] G SF{T).) 

First we show ISl. Let ^ be the right-hand side of the (g,(r)-rule in R. 
Then, by the definition of we have ^(s) =>t ^ , . . . , Sk] and, 

by Corollary 2.8, it holds that nf(=^T.q(s)) = nf(=>T.(\x^ si xu 
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sjb]). Since, by the induction hypothesis on L, nf{^Ti^[^i ^ ^ 

sjk]) G we get that n/(=^T) 9(«)) is also in T^. 

The proof of IS2 (and IB) can be done by induction on the structure of 

(i) Let ^ = p{xi), for somep G Q and 1 < i < fc. Then ^[xi si , . . . , xjb ^ 
Sk] - P(si) and thus n/(=^T,^[a?i ^ si,,..,Xk ^ «*]) = n/(=^T,p(si))- 
Since, by the induction hypothesis on nf(=>T,p{si)) G T^, we also have 

^ si,, . ,,Xk *- sjb]) G Ta- 

(ii) Now let ^ = 5(^i, . . . ,^/), for some 6 G with / > 0. Then, ^[a:i +— 

Si,...,Xk ^ Sk] = ^ Si,,.,,Xk ^ Sjb], • • ^ Si,...,Xk Sjb]) 

and thus, by Lemma 3.15, 

^/(=>T,^[a?i ^si,..,,Xk ^Sk]) = 

6{nf{=>T,^i[xi ^si,...,xjk ^sjb]),... 

• • • ,^/(=>'T,6[iCl ^ Sl, . . .,Xjb ^ Sk])). 

On the other hand, by induction hypothesis on for every 1 < i < /, we 
have nf{:^T,^i[xi ^ si,...,xjb ^ Sk]) G T^, and thus n/(=^T,^bi ♦- 
si, . . .,xjb ^ Sk]) G Ta. 

Now our lemma follows from the fact that L holds and that, by Lemma 3.7, 
each sentential form (p can be written in the form ^[xi si, . . .,xjb ^ sjk], 
for some A: > 0, ^ G RHS{Q, A, Ar), and «i, . . . , sjk G Ti?. □ 

We can now define the tree transformation induced by a top-down tree 
transducer. In fact, it is a mapping that takes every input tree s eTs to the 
normal form of the initial sentential form qo{s). 

Definition 3.17. Let g G Q be a state of T. Then 

(a) The tree transformation induced by T with q is the mapping : 

Ts — ► Ta such that, for every s G T^;, we define = n/(=^Tj q{s)). 

(b) The tree transformation induced by T is the mapping □ 

We call a tree transformation a top-down tree transformation if it can be 
induced by a top-down tree transducer. Moreover, the class of all top-down 
tree transformations is denoted by TOP. 

By Theorem 3.16, the normal form of a sentential form p is in Ta. On the 
other hand, if ^ is a sentential form and, for some s G we have p s, 
then clearly s is irreducible and hence it is a normal form of p. In conclusion 
this means that p some s G T^, if and only if s is a normal form 

of p. Therefore the tree transformation induced by T can alternatively be 
written in the following form, which is generally used in the theory of tree 
transformation: 



Ty®’’ = {(r, s)€Ts xTa 1 qo{r) =>-y s}. 
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To finish this section, we give an example of a tree transformation. 

Example 3.18. It can be easily seen that the tree transformation induced by 
T' in Example 3.3 is 

r^r = {(a,a)}U{(7”(a),5(7'*-HA).7"-^(/?2))) I n > 1}. 

In the rest of the book we will call this tree transformation Tchain- to- chains 
(Fig. 3.3). □ 
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Fig. 3.3. The tree transformation Tchain —to— chains 



3.3 Characterization of Top-Down Tree Transformations 

The aim of this section is to introduce certain inductively defined functions for 
a top-down tree transducer which will be very close to the tree transformation 
induced by it and will serve as both a precise and convenient tool for proving 
statements concerning top-down tree transformations. 

First we present the principle of definition by simultaneous induction 
which can be viewed as an extension of the principle of definition by structural 
induction (see Chap. 2). Then, using this new principle, we define two families 
of functions concerning a top-down tree transducer and show that they can 
also be expressed by normal forms of sentential forms. Hence they also define 
the tree transformation induced by a top-down tree transducer. 

We start with the following definition. 

Definition 3.19. Let 17 be a ranked alphabet and let C be a set. Let A be 
a set and, for every a £ let fa : Ts C he a, function. Moreover, for 
every Ar > 0, let Bk be a set and, for every b £ Bk^ let gb^k • ► C be a 

function. 
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Let 

F = {fa\aeA} 



and let 



G = {gb,k \ b e Bk for some k > 0}. 



We say that F is defined if, for every a E A and s E Tjj, fa{s) is defined. 
Similarly, we say that G is defined if, for every k >0jb E Bk and , . . . , Sjfc E 
Ts, ^ j ^k)) are defined. □ 



Definition by simultaneous induction is a method to define simultaneously 
families of functions like F and G. This method generalizes the principle of 
definition by structural induction (Principle 2.11). The principle of definition 
by simultaneous induction is as follows. 

Principle 3.20. Let F and G be two families of functions as in Def. 3.19. 
The principle of definition by simultaneous induction is: 

If 

IB: for every b E 5q, the value gb,o{{)) is defined, and 
ISl: for every a G A, fc > 0, and si,...,Sjk E Tjs;, if for every 
b E Bk, gb,ki(si,. . .jSk)) is defined, then, for every a E 
/a(^(si, . . . , Sjb)) is defined, and 

IS2: for every fc > 1 and si,...,sjb E Ts, if for every ai,...,aA: E 
A, fai (Sl), • • • , fajc (sk) are defined, then, for every b E Bk, 
P6,Jk((«i, . . is defined, 

then F and G are defined. □ 

The abbreviations IB, ISl, and IS2 stand for induction base, induction step 
1, and induction step 2, respectively. 

Similarly to the principle of proof by simultaneous induction, when we 
define F and G by simultaneous induction, the items IB and IS2 can usually 
be incorporated into one step. Namely, IB can be incorporated into IS2 by 
extending IS2 to the case A: = 0. In fact, IS2 in the case k = 0 means that, 
for every 6 G Bo, fl^fc,o(()) is defined, which is exactly IB. 

Note 3.21. Similarly to Principle 3.11 in concrete applications of Principle 
3.20, the sets Bjb, fc > 0 will form a hierarchy. Moreover, for every A: > 0 and 
b E Bk, the functions gb^k will have the property that, for every I > k and 
sequence , . . . , s/ of trees in Ts , 

gb,k({si , . . . , sjb)) = gb,i{{s \ , . . . , si)). 

The arity of a function gb^k will always be clear from the context, namely 
from the specification of its arguments, therefore we drop the index k from 
gb^k in concrete applications of Principle 3.20. 

Using this principle, we define two families of functions concerning a top- 
down tree transducer as follows. 
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Definition 3.22. Let T = {Q^E^A^qo^R) be a top-down tree transducer. 
We define families 

= \ q€Q} 

and 

Gt = {4"e = I ^ > 0- ^ € RHSiQ, A, k)} 

of functions by simultaneous induction as follows. 

IB: For every ^ G RHS{Q, A, 0), let )) = ^ . 

ISl: For every cr G with fc > 0, si, . . . , Sjb G T^;, and g G Q, we define 
)•••)«*)) = ((«i ) • • • ) ^k)), where ^ = rhs{q, a). 

IS2: For every fc > 1, ^ G RHS{Q,A,k), and u = (si,...,^fc) € we 

define by induction on the structure of ^ as follows: 

(i) If ^ = p{^i) for some p £ Q and 1 < i < Ar, then = r^J{si)- 

(ii) If ^ for some 6 G A^^^ with / > 0 and G 

RHS{Q, A, k), then 4 -/( 0 ;) = 6 ( 4 -/^ (a;), . . . , 4 -/( 0 ;)). □ 

We note that in the particular case ^ G it holds that 
whatever k is. This is always the case, in particular if fc = 0. Therefore, 
for every s = <7 G E^^^ and q E Q, it holds that rjP^(s) = where ( = 
rhs(qj a) G RIIS(Qj Aj 0) = T^. 

In the following we give an example for a computation of the functions 

^ind 

Example 3. S3, Let T' be the tree transducer defined in Example 3,3. We 
compute 4 ?^^^ (72 (a)) . 

= (t(»)), 4”, ft (T(a))) 

= «(4?.‘'.(ft(x, ))((«)). -^:'.(ft(xO)((“))) 

= ^(T(4"fft(x, )((«))). t( 4" ft(xi)((“)))) 

= %(4"!|,i(«)).T(4”ft(«))) 

= ^(t(4’?S.(()))-^(^^S.(0))) 

= 6(7(/?i).t(;02)) 



We now show that the functions defined by normal forms, and r*-^, 
defined by simultaneous induction, for a top-down tree transducer T coincide. 
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Since the proof will be performed by simultaneous induction, we also require 
the missing function 

Definition 3.24. Let T = {QjE,Ayqo,R) be a top-down tree transducer, 
> 0) ^ ^ RHS{Qj A, k). We define the function : (Ts)^ Ta for every 
uj = (si, . ..,Sk)e by = n/(=>T,^[a?i ^ 5i, . . . , a:* ^ Sjb]). □ 

Now the proof of the equality of and follows. We recall from Lemma 
3.7 that ^ [a?i , . . . , xjk ^ s*] E SF{T). 

Theorem 3.25. Let T = {Q,U,A,qo,R) be a top-down tree transducer. 
Then the predicates K and L hold, where 

K : For every g E Q and s = r^^q{s)- 

L : For every ^ E RHS{Q, Zi, k) with k >0 and = (si, . . . , sjb) E 
(T^)^ (a;) = 



Proof. We prove by simultaneous induction and start by proving ISl. Let 
g E = cr(5i, • • - for some <r E k > 0, and si, . . .,Sjb E Tjj, 

Moreover, let cj = (si, . . . , s*). First we note that, by Def. 3.4, it holds that 



«(s) =^T ^[a!i ^ Si, . . . ,xt <- Si], where ^ 
as follows: 


= rhs{q, <r). Then we can compute 


= «/(=J'T,g(s)) 


(by Def. 3.17) 


= nf{=i-T,^[xi<-si,... 




. . . ,Xi <— Si]) 


(by Corollary 2.8) 


= 


(by Def. 3.24) 


= 


(by induction hypothesis on L) 


= 


(by Def. 3.22). 



Now we show IS2 by induction on the structure of 

(i) Let ^ = p{xi), for somep E Q and I <i < k. Then ^[x\ ^ si , . . . , xjb ^ 
Sk] = p{si) and thus n/(=>T,^[iPi ^ si,...,Xk ^ Sk]) = n/(=>T,p(«i))- 
Hence we obtain the following: 



^der 

^T,e 



(w) 



...,Xk <-Sife]) 

nfi^T,p{si)) 

4%{si) 



(by Def. 3.24) 

(by Def. 3.17) 

(by induction hypothesis on 7C) 
(by Def. 3.22). 



(ii) Let now^ = ^(6> • • • for some 5 € with / > 0, and^i, . . . € 

RHS{Q,A,k). First we note that ^[xi <— si,...,Xfc <— s*,] = 5(^i[xi <— 
si,...,Xk *- Sfc],. *-si,...,Xk^ sit]) and therefore 
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nf{=>-T,^[xi <- Si art ^ sj,]) = 

nf{^T,^l[xi *— Si,...,Xk *— Si])) 



Thus we get the following. 



= nf{=^T,^[xi ^ si,...,Xk *- Sk]) 

(by Def. 3.24) 

= H‘^fi=>T,^i[xi <-Si,...,ari <-Si]),... 

■■■,nfi=^T,h[xi *-si,...,Xk <-Si])) 

= (by Def. 3.24) 

= ^(a 7 ’"f,(w), . . . , Ty"^(w)) (by induction hypothesis on £.) 

= 4”/(w) (by Def. 3.22). 



We have proved that the two function families defined by means of normal 
forms and by simultaneous induction are the same. Therefore, in the rest 
of the chapter we drop the superscripts der and ind when we refer to the 
functions related to a top-down tree transducer. However, it should be clear 
from the context which definition of the function we are working with. 



3.4 Height Property 

In this section we show that the height of the output tree 7 ^( 5 ) associated to 
an input tree s by a top-down tree transducer T is linearly bounded by the 
height of 5. 

We need the following technical lemma which is essentially the general- 
ization of the observation made after Def. 3.22. 

Lemma 3.26. Let T = (Q, i7, Zi, go) be a top-down tree transducer. 
Then, for every Ar > 0, ^ G RHS{Q,A,k), and lj = (5i,...,sjb) € (Ti?)*, 
it holds that ^ TT^q{sj) ; q{xj) G Q{Xk)] where Q{Xk) 

denotes the set {g(a?i) | G Q, 1 < i < fe}. 

Proof. We prove by structural induction on 

(i) Let ^ = p{xi) for some p E Q and 1 < i < k. Then = TT^p{si) 

by the definition of 7 ^,^. On the other hand, 

TT.p(s<) = P{xi)[q{xj)*-vr,q{^j)>9ixj)eQiXk)] 

(by the definition of <— ) 

= rr,g(sj) ; q(xj) G <3(A:j:)] 

(by ^ = p(a!i)). 
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(ii) Now let f = 6(^i, . . for some 6 G with / > 0, and ^i, . . . G 
RHS{Q, A,k), Then we have 

(by Def. 3.22) 

= <- rr.«(si) ; q{xj) e . . . 

• • • .6[9(®j) ^ TrAsj) ; q{xj) G Q(X|fc)]) 

(by induction hypothesis on 
= 5(6, • • • ,6)[9(a?j) <- rr,,(sj) ; q(xj) G Q(X*)] 

= 69(®i) — n’,«(«i) : 9(a:i) G Q(Xfc)]. 

□ 

We can now prove the relation mentioned earlier between the input and 
output trees of a top-down tree transducer. 

Lemma 3.27. Let T = {Q, U, A^qo^ R) be a top-down tree transducer. 
There is an integer c > 0 such that, for every s £ Tsy height{vr{s)) < 
c • height{s). 

Proof. Let c be the maximal height of the right-hand sides of rules in R. We 
show that, for every q £ Q and s G Ts^ height{vr^q{s)) < c • height{s). We 
prove by induction on s. 

(i) Let s = (7 G and let ^ = rhs(g,(T). Then vr^q{s) = ^ and we 
obtain that height{vr^q{s)) = height{^) < c- height{s). 

(ii) Now let s = (t(si, . . . , Sk), for some k >1, a E and si, . . . , sjb G 

Tjj. Moreover, again let ^ = r/is(g, cr). Then we can compute as follows, where 
again Q{Xk) denotes the set {g(ar,) \ q £ < i < k}, 

height{Tr,,{s)) = height{^[q{xj) <- rr,,{sj) ; q{xj) £ Q{Xk)]) 

(by Lemma 3.26) 

< height{^)+ 

max{height{vr,q{sj)) 1 q{xj) e Q{Xk)] 

(by the definition of height) 

< c-^max{height{vr^q{sj)) | q{xj)eQ{Xk)} 

(by the definition of c) 

< c -h max{c • height{sj) \ I < j < k} 

(by induction hypothesis) 

= c • (1 -f max{height{sj ) I 1 < i < *}) 

= c • heighi{s). 

□ 



3.5 Subclasses of TOP 

In this section we introduce and study restricted versions of top-down tree 
transducers. These will induce certain subclasses of TOP. Actually, we im- 
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pose three kinds of restrictions and obtain the concepts of a homomorphism 
tree transducer, a linear top-down tree transducer and a superlinear top-down 
tree transducer. We give an example of the tree transformation induced by 
each of the restricted versions. Finally we compare the tree transformation 
power of the versions mentioned by giving an inclusion diagram of the tree 
transformation classes which they induce. 

We start with defining the restrictions. 

Definition 3.28. Let T = (Q^S^A^qo^R) be a top-down tree transducer. 
Then 

(a) T is a homomorphism tree transducer if it has only one state, i.e., if Q = 
{go}- A tree transformation is called a homomorphism tree transformation 
if it can be induced by a homomorphism tree transducer. The class of all 
homomorphism tree transformations is denoted by HOM, 

(b) T is linear if, for every rule g((r(a?i, . . . , a?jb)) —*^^ inR and every 1 < 
i < k, the variable X{ occurs at most once in ^ . A tree transformation is called 
a linear top-down tree transformation if it can be induced by a linear top- 
down tree transducer. The class of all linear top-down tree transformations 
is denoted by l-TOP. 

(c) T is superlinear if it is linear and, for every a G with A: > 0 and 

q^q' ^ Q with q ^ g', Varx{rhs{q,a)) fl Varx{rhs{q\(r)) = 0. Informally, 
this property means that for every a G with A: > 0 and every 1 < 
i < kj there is at most one q E Q such that Xi occurs in rhs{q,(T), A tree 
transformation is called superlinear top-down tree transformation if it can be 
induced by a superlinear top-down tree transducer. The class of all superlinear 
top-down tree transformations is denoted by sl-TOP. □ 

The attributes linear and superlinear can also be applied to homomorphism 
tree transducers. Thus we also have classes l-HOM and sl-HOM. 

In the following we give examples for the tree transformations induced 
by each of the above restricted top-down tree transducers. First we present 
a homomorphism tree transducer. 

Example 3,S9, Let the top-down tree transducer T = ({g}, i7, Zi, g, iJ) be 
defined as follows: 

• R consists of the rules 

1) 5(7(®i)) -^o’(9(«i),9(a:i)), 

2) ?(a)-+a. 

Then obviously T is a homomorphism tree transducer, hence vr € HOM. A 
derivation by T looks as follows: 

9(t^(“)) =^t <T(g(7(a)),g(7(a))) =>t <T(«r(g(a),(ir(a)),g(7(Q))) 

=!^\. <r(<r(a,a),g( 7 (a))) <r(<r(a;,a),<T(ar,a)). 
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Now we explicitly give vr. For this, let us denote the fully balanced tree over 
A of height n by Sn, i e-j 

(i) let Si = a, and 

(ii) let Sn+i = for every n > 1. 

Then we can prove by induction on n that, for every n > 0, we have 

= «n+i: 

(i) If n = 0, then q{j^{a)) = q{a) =>t cr = si, hence vr{j^{ot)) = Si, 

(ii) Let n > 0. Then q{y^{oi)) =>t 9(7'^”^(cr))) and, by 

induction hypothesis and Theorem 3.25, 9(7”"’^ (cr)) =>t Hence we get 
g(7"‘(a)) =4^5. cr(sn,«n) = «n+i, meaning that t^( 7”(«)) = «n+i* 

Thus the tree transformation induced by T is 

VT = {(7 "(q:),s„+i) I n > 0}, 

(Fig. 3.4). We call this tree transformation Tdoubie- ^ 



”7 

! '^double ^ ^ 


" (7 “ 

/ \ 


i 

-7 


/ \ 

_ Cr <T _ 



i— —i 

/ \ / \ 



a 



a a a 

I 

2” times a 



a 



Fig. 3.4. The tree transformation Tdoubie 



The following example shows a linear top-down tree transducer. 

Example 3.30. Consider the top-down tree transducer 
T={{qi,q2},S,AquR), where 

• i: = A = 

• R is the set of the rules 

1) 91(7(2:1)) ^71(92(3:1)), 

2) 92(7(2:1)) -^72(91(2:1)). 

3) gi(a)^a, 

4) 92(a) ^ a. 
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Then T is a linear top-down tree transducer and hence t>t € l-TOP. 

At the same time, T is not superlinear because x\ appears in both 
rhs(^i, 7 ) and rhs(g 2 jT)- A derivation for T looks as follows: 

9 i ( t ^(“)) =^t 71(92(7^(0;))) 71(72(91(7(0)))) 

=>T 71(72(71(92(0)))) 71(72(71(0))). 

Again, we are going to specify the tree transformation induced by T. We 
define the trees si,m, « 2 ,m, for each m > 1, by induction on m as follows: 

(i) Let si,i = S 2 ,i = o. 

(ii) For m > 1, let = 7i(«2,m) and S2.m+i = 72(si,m). 

Then, it should be clear that the tree transformation induced by T is 
= {(r, Si,m) \r€Ts,m = height(r)}. 

The above tree transformation will be called Taitemate, (Fig. 3.5). □ 
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Fig. 3.5. The tree transformation Taitemate 



The third example is a superlinear top-down tree transducer. 

Example 3.31. Let T = ({gi, ^ 2 }, be the top-down tree trans- 
ducer, where 

. i:= {6(2),aW}, 4 = {1(1), 2(1), aW} 

• R is the set of the four rules 

1) gi(5(a;i,a:2)) l(92(a;i)), 

2 ) qi(6(x^,X 9)) —*2(a^(xo)). 
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3) gi(a) -+ a, 

4) q 2 {a) -* a. 

Clearly T is a superlinear top-down tree transducer and thus vr G sUTOP. 
First we present a derivation by T: 



?i( 6 ( 5 (a,a),a)) =»t 1(92(5(0;, «))) =^t l( 2 (gi(a))) => l( 2 (a)). 

Next we give vr. We define, for i = {1,2}, the z-path Wi G of a tree 
r G Tj;, by structural induction as follows: 

(i) If r = a, then Wi = a, 

(ii) Let r = 6 (ri,r 2 ), where ri,r 2 G Tj;, and let the i-path of Vj be Vjj, for 
j — {1,2}. Then we define w\ = 1 ^ 2 , i and W 2 = 2 i;i, 2 - 

Intuitively, the z-path of a tree r = 5(ri, r 2 ) G Ts can be obtained by taking 
the z-th branch at the root 6 of r and then taking the (zmod( 2 ) -|- l)-path of 
the subtree r,- we reached. (For example, let r = 6 ( 6 (a, a), a). Then the 1- 
path of r is 12(a) and the 2-path of it is 2(a).) It can be proved by structural 
induction that, for each r G Tj;, we have 'Pr,gi(^) as the z-path of r. Hence 
we obtain 



Vr = {(r, w) I r eTs and w is the 1 -path of r }. 

We will call this tree transformation Vzigzag (Fig. 3.6). □ 
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We devote the rest of this section to comparing the tree transformational 
capacity of the variants of top-down tree transducers we have defined up 
to now. We perform the comparison by giving the inclusion diagram of the 
set of the six classes TOP, /-TOP, s/-TOP, HOM, UHOM^ and sl-HOM 
above, with respect to inclusion. We construct the inclusion diagram with the 
method presented in Sect. 2.2, i.e., we first give a conjecture for the inclusion 
diagram and then prove that it is the inclusion diagram. We recall that the 
proof involves determining the minimal set of inequalities that are necessary 
to prove that the conjectured diagram is the inclusion diagram. 

We start by giving some inclusions and one equality which are immediate 



consequences of Def. 3.28. 
Corollary 3.32. 1) HOM 


C 


TOP 


2) 


l-TOP 


c 


TOP 


3) 


sl-TOP 


c 


l-TOP 


4) 


l-HOM 


c 


HOM 


5) 


sl-HOM 


c 


sl-TOP 


6) 


sl-HOM 


= 


l-HOM. 



Proof. The inclusions 1) - 5) and sl-HOM C l-HOM hold by definition. We 
only have to prove the converse of the last one. 

Therefore, take a linear homomorphism tree transducer 
T = {{qo}jS^A,qo^R) and an input symbol a G with fc > 0. Then T 
is obviously superlinear because there are no two different states in {go}) see 
(c) of Def. 3.28. Hence l-HOM C sl-HOM, too. □ 

We now give a conjecture for the inclusion diagram. We conjecture that 
the inclusion diagram of the set of the six classes we consider is that which 
can be seen in Fig. 3.7. We now prove that this is true. 

We first note that every inclusion which is shown by the diagram has 
already been proved in Corollary 3.32. Moreover, we observe that the in- 
equalities which are necessary to prove are as follows: 

TOP - HOM 0 

HOM - sl-HOM 0 

TOP - l-TOP # 0 

l-TOP - sl-TOP / 0 

sl-TOP - sl-HOM ^ 0 

HOM - sl-TOP ^ 0 

HOM - l-TOP ^ 0 

sl-TOP - HOM i: 0 

l-TOP - HOM ^ 0. 

Then it is straightforward to see that the minimal inequalities are the follow- 
insc. 
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HOM - l-TOP ^ 0 

sl-TOP - HOM ^ 0 

UTOP - S/-TOP ^ 0 

Next we prove the first one. 

Lemma 3.33. HOM — l-TOP / 0. 

Proof. We prove by contradiction that the tree transformation Tdoubie ap- 
pearing in Example 3.29 cannot be induced by any linear top-down tree trans- 
ducer. Therefore, assume that Tdoubie = for some linear top-down tree 
transducer T' = (Q, i7, go? H!)- Take an arbitrary element of Tj;, which, of 
course, has the form 7 ^( 0 ), for some n > 0. Since Tdoubie = 'Trs there is the 
derivation qo{j^{cc)) «n+i- Suppose that the mth step of this derivation 
is the first such one in which the right-hand side of the rule applied for 7 is 
not of the form p(xi), for some p £ Q. Then the involved derivation can be 
shown as 

«o(7"(«)) =^T' «»+i, 

where p SQ, p{xi) is the right-hand side of the rule applied in the (m— l)th 
step and p{j{xi)) — ^ ^ is the rule applied in the mth step. 

Next we show that ^ contains at least one occurrence of a. Actually, 
either ^ = a or ^ contains at least one occurrence of a. In the first case our 
statement obviously holds. Considering the second case we observe, on the 
one hand, that a has rank 2 and, on the other, that xi can occur at most 
once in ^ because T' is linear. However these two conditions mean that ^ 
should contain at least one occurrence of a in the second case, too. 

So there is at least one occurrence of a in ^ anyway. Our next observation 
is that the lengths of all paths in ^ leading from its root to occurrences of a 
must be the same (specifically they must be n + 1 , because the length of all 
such paths in Sn+i is n -h 1 ). 

Finally, let n' > n and consider the derivation 

go(7"'(«)) ^[®i - 7”'-'"(«)] =^T' 

Then there will be paths of length n -h 1 in Sn'+i leading from its root to 
occurrences of a, because there are such paths in due to our observation. 
However, this is a contradiction because the length of every such path in 
Sn/+i must be n' + 1 > n + 1 . 

Hence Tdoubie cannot be induced by a linear top-down tree transducer. □ 
Corollary 3.34. HOM - sl-TOP 96 0. 
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Proof. The statement is an obvious consequence of Corollary 3.32 and 
Lemma 3.33. □ 

We now prove the second minimal inequality. 



Lemma 3.35. sl-TOP — HOM ^ 0. 

Proof. We show that the tree transformation r^igzag, see Example 3.31, 
cannot be induced by any homomorphism tree transducer. 

Again, we prove by contradiction. Assume that there is a homomorphism 
tree transducer T' = ({p}, i^, -4,p, iJ'), such that Tzigzag = Moreover, let 

R' = {pW®i.a:2)) -►^,p(q;) -►C}- 

Concerning the form of three different cases are possible, namely, either 

- C G or 

- x\ occurs in ^ but X 2 does not, or 

- a ?2 occurs in ^ but x\ does not. 

(Note that x\ and a ?2 cannot both occur in because all symbols in A have 
rank at most 1.) 

We show that all cases are impossible. 

Case 1: ^ G Then, for every input tree r = 5(ri,r2) G we have 
p(r) = p(^(ri, r 2 )) showing that 

= {(q^,C)} U {(r,0 I r = 5(ri, r 2 ), for some ri, r 2 G Ts). 

However this contradicts Tzigzag = see Example 3.31. 

Case 2: x\ occurs in ^ but X 2 does not. Then, obviously ^ = vp(a?i), for 
some V G Now, take an input tree r = 6(5(a, ri), a), where r\ G Ts 

is an arbitrary tree. For this r, we have 



p(r) =p(5(^(a, ri),^)) =>t/ vp{8(a^r\)) vvp{a) 

meaning that 7^/(r) = vvC does not depend on r\. On the other hand, as was 
shown in Example 3.31, Tzigzag(T) = 12u;, where w is the 1-path of ri, which 
of course does depend on ri. This contradicts Tzigzag = tT'- 

Case 3: X 2 occurs in ^ but x\ does not. In this case ^ = vp{x 2 )^ for some 
V G Take an input tree r = ^(ri,a), where r\ G Ts is arbitrary. 

Then we can compute as follows 

p{r) — p(6(ri, a)) vp{oi) =>t' 

which means that 7^/(r) = vC for every ri G T^;. On the other hand, 
Tzigzag(T) = lu;, where w is the 2-path of ri, which contradicts Tzigzag = tt>- 
With this we have proved that Tzigzag cannot be induced by any homo- 
morphism tree transducer. n 
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Corollary 3.36. /-TOP - HOM ^ 0. 

Proof. The statement immediately follows from Corollary 3.32 and Lemma 
3.35. □ 




Fig. 3.7. The inclusion diagram of TOP and all its subclasses 



We now state and prove that the conjectured diagram is the inclusion 
diagram of the six tree transformation classes involved. 

Theorem 3.37. The diagram in Fig. 3.7 is the inclusion diagram of the tree 
transformation classes TOP, /-TOP, sl-TOP, HOM^ l-HOM, and sUHOM. 

Proof. We have already proved all the inclusions and two inequalities which 
are necessary to verify our statement. 

The only element which remains to be proved is the inequality /-TOP— s/- 
TOP ^ 0. This will follow from two lemmas of the forthcoming section. In 
fact, in Lemma 3.41 we will prove that /-TOP is closed with respect to 
composition, while, in Lemma 3.43, we will show that sUTOP is not closed 
under composition. Hence l-TOP and sUTOP cannot be equal. □ 

Closing this section, we present a lemma which expresses that identical 
tree transformations can be induced already by the simplest top-down tree 
transducers we have considered. In the forthcoming sections we will use this 
lemma. 

Lemma 3.38. ID C sTHOM, i.e., for every ranked alphabet E, the iden- 
tical tree transformation Id{Ts) is in sl-HOM . 

Proof. Let E be an arbitrary ranked alphabet. Consider the top-down tree 
transducer T = ({g}, L*, E^ g, R) where R is the smallest set of rules such that 
for every a G E^^^ with fc > 0 the rule 
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q((T{xi (r(q{xi), q{xt)) 

is in R, Then we can prove that for every s £ Ts , vr{s) = hence vp = 
Id{Ts). Moreover, T is a linear homomorphism tree transducer and hence, 
by Corollary 3.32, it is in sUHOM^ too. □ 



3.6 Composition and Decomposition Results 

In this section composition and decomposition results concerning TOP and 
its subclasses will be proved. We speak about a composition theorem when it 
is proved that the composition of two (or sometimes more) tree transforma- 
tion classes is the subclass of a single tree transformation class. Conversely, 
if a tree transformation class is a subclass of the composition of some other 
tree transformation classes, we speak about decomposition results because in 
this case every tree transformation of that class can be decomposed into tree 
transformations in the other classes. In both cases equality can hold. This 
gives a kind of characterization of the single class which is on either side of 
that equation. 

In the first theorem we show that TOP is closed under composition, i.e., 
that the composition of two top-down tree transformations can always be 
induced by a top-down tree transducer. 

Theorem 3.39. TOP o TOP = TOP, 

Proof. By Lemma 3.38 and Corollary 3.32, ID C TOP. Hence, by Lemma 
2.14, TOPCTOPoTOP. 

The proof of TOP o TOP C TOP is more technical. It requires to 
construct, for any two top-down tree transducers T\ = {Qi,D,A,qi,Ri) 
and T 2 = {Q 2 ,A,n,q 2 ,R 2 )i a third top-down tree transducer T such that 

TTiOTt^ = Vp. 

First we modify T 2 by introducing the top-down tree transducer T 2 , so 
that T 2 can process the right-hand sides of rules in R\. Therefore we define 
mx = max{Ar | ^ 0} and the ranked alphabets 

I 9GQi,1< I < mx} 

and 

Q 2 X Q\{Xmx) = I (p,?) G <?2 X <5i,l < i < mx). 

We note that the ranked alphabets Qi{Xmx) and Q 2 x Qi(Xmx) contain 
only elements of rank 0 and that strings like and (p, denote 

individual symbols here. 

Then we put T 2 = {Q 2 , Qi{Xmx)y H ^ Q 2 x Qi(Xmx), 92, ^ 2 )> where 
i ?2 = i ?2 U {piq(xi)) (p, q)(xi) | p G O 2 and q(xi) G Q\(Xmx)}> 




88 



3. Top-Down Tree Transducers 



Next we prove the following statement, which shows the connection be- 
tween T 2 and T 2 . 

Statement For every p G Q 2 , ^ G RHS{Qi,A,k) with 0 < fc < mx, and 
family {tqj ETa \ 9 € Qi, 1 < i < A:} of trees, we have 

^ h.i : ?(®i) G Qi(^ifc)]) 

= Tr',p(^)[(p. 9)(xj) ^ ; (p, q){xj) G Q 2 x Qi(A’j:)]. 

We prove this statement by structural induction on 

(i) Let ^ = q{xi), for some g G Qi and let 1 < i < Ar. Then 

^ e Qi(Xjfc)]) 

= ^3.p(^«.«) 

= (P, «)(a:0[(p. 9)(®i) ^ > (P. 9)(®i) G Q 2 x Qi(Xfc)] 

= Pri,p(0[(P. 9)(®i) ♦- rra,p(i,-,i) ; (p, 9)(®i) G Q 2 x Qi(Xt)]. 

(ii) Let ^ = 5 (^ 1 , . . .,$»), for some 6 6 with / > 0, and G 

RHS{Qi,A,k). Moreover, let ^ be the right-hand side of the (p, 5)-rule in 
R 2 . Then we can compute as follows: 

^3,p(^[9(®j) ^ k,i ; 9(®i) G Qi(A’fc)]) 

= ■Tr,,p(^(6[9(!Ci) ^ tqj ; q{xj) e Qi(Xi)], . . . 

• • • ^q,j ) q{xj) G Ql(Xi)])) 

= ^ tq,j ; q{xj) G <9i(A-*)], . . . 

• • ,^i[dxj) i- tqj ; q{xj) € Qi(>’i'*)]) 

(by Def. 3.22) 

= ‘^2,q{^i[q{^j) ^ ^q,i > q(^j) G Qi(Xi;)]) ; ?(*») G Qi(.?ffe)] 

(by Lemma 3.26) 

= ([q(xi) <- PT',f(^i)[(P, «)(*;•) ^ 

^ 3 >p(^fj);(p. 9 )(®i) G Q 2 X Qi(Xt)] ; q(x{) G Qi(Xt)] 

(by induction hypothesis) 

= ^[ 9 (a ;0 ^ TT^.qiij); q(xi) G Qi(Xfc)][(p, q)(xj) ^ 

(p. 9)(®j) G Q 2 X <3i(Xjfc)] 

(by the associative property of ^) 

= ^,)[(p, q)(xj) ^ ; (p, q)(xj) € Q 2 x Ql(Xi)] 

(by Lemma 3.26) 

= tt',p(0[(p. 9)(®j) ^ ^2.pih.j ) ; (p. 9)(iCi) G <92 X Qi(Xjb)] 

(by Def. 3.22). O 

We now define the top-down tree transducer T = (Q 2 x 

Qi, 17, 1?, ( 92 , 9i)) ^), where i2 is the smallest set of rules satisfying the 
following condition. If the rule q{a{xi, . , . ,Xk)) ^ is in i?i, for some 
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9 € Qi , G with fc > 0 and ^ G RHS{Q \ , -4, fc), then, for every p G Q 2 , 
the rule 

(P. 9)(o-(a!i, • • • . ®ife)) PT',p(0 

is in R. 

We note that T 2 can work on the right-hand sides of rules in R\ because, 
for every 0 < Ar < mar, we have RHS{Q\,A,k) C T^uQi(x„»,)- Moreover, it 
can be proved by an easy induction that, for every ^ G RHS{Q\,A, k) with 
0 < Ar < mx and p G Q 2 , it holds that Pr',p(0 G iJ/f5(Q2 x Qi, f?, Ar). Hence 
the definition of R is syntactically correct. 

To see that pr = o , we prove that the predicates K and L hold by 
simultaneous induction. 

K : For every p G Q 2 ,g G Qi, and s G Tsy it holds that T^ 3 ,p(PTi,g(s)) = 

L : For every 0 < A: < mx, ^ € RHS{Qi,A,k), p e Q 2 , and u = 

(«i, •••,«*) € (Ti)*, we have rr,,p(Pr,,«(w)) = ‘rr,e'(w), where = 

First we prove ISl. Let s = <r(si , . . . , s*) G for some ifc > 0, a € 

and «i, . . . , sjfc G T!e. Moreover, let ^ be the right-hand side of the (q, (r)-rule 
in Ri and let w = (sj , . . . , sj). Then we have 

^3.p(pti,j(s)) = •rr 3 .p(PTi.f(w)) (by Def. 3.22) 

= rr,(i{u) (by induction hypothesis on L), 

where = 7^',p(^). On the other hand, by the definition of T, is the 
right-hand side of the ((p, g), o-)-rule in R, and hence vr,(>{u) = Pr,(p, 4 )(s)- 
We now prove IS2. We compute as follows. 

Pr3.p(PTi.f(w)) 

= Pr„p(^[9(«j) ^ PTx,f(si) ; q{xj) G Qi(A’*)]) 

(by Lemma 3.26) 

= Tri,p(0[(P. 9)(®i) ^ rr,.p(Pr..f(sj)) ; (p, q){xj) G Q 2 x Qi(A’t)] 

(by the Statement above) 

= •rr',p(0[(p. 9)(®i) ^ ^,(p,f)(«i) ; (p. 9)(a;j) e Q 2 X Qi(Xit)] 

(by induction hypothesis on K) 

= Pr.e'(w) 

(by Lemma 3.26) 

where 

Now, since K holds for p = q2 and g = gi, we obtain Pr = PTi o pr^. With 
this we have proved that TOP o TOP C TOP. □ 



Next we show that HOM is also closed under composition. 
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Corollary 3.40. HOM o HOM = HOM. 

Proof. By Lemma 3.38 and by Lemma 3.32, we have ID C HOM, Hence, 
by Lemma 2.14, HOM C HOM o HOM. 

To prove the converse inclusion, we make the following observation. If in 
the proof of Theorem 3.39, the top-down tree transducers Ti and T 2 have the 
only states qi and ^ 2 , respectively, then T will have the only state (g 2 j^i), 
that is, T is also a homomorphism tree transducer. Hence, by 7 ^ = o 7 ^ 2 , 
we get HOM o HOM C HOM. □ 

We now show that UTOP is also closed under composition. 

Corollary 3.41. UTOP o UTOP = UTOP. 

Proof. By Lemma 3.38 and by Lemma 3.32, we have ID C l-TOP. Hence, 
by Lemma 2.14, l-TOP C l-TOP o l-TOP. 

The exact proof of l-TOP o l-TOP C l-TOP is more complex, although 
intuitively it can be seen easily. 

We prove that if, in the proof of Theorem 3.39, the tree transducers T\ 
and T 2 are linear, then T constructed from them is also linear. In other 
words, having the notations of Theorem 3.39, we should prove that, for every 
(p, g)eQ2x(3i,<T€ with Ar > 0 , and 1 < i < fc, the variable X{ appears 
at most once in rhs{{p^q),a). This is the consequence of the two following, 
slightly more general statements. 

Let p G Q 2 , ^ G RHS{Qi,A^k) with k > 0 and 1 < i < k. Then the 
following two statements hold. 

(a) Variables occurring in rx',p(0 ^tlso occur in 

(b) If T 2 is linear and x,- occurs at most once in then it occurs also at 
most once in 

The proof of both statements is performed by structural induction on ^ . 

(i) Let ^ = q{xj), for some q £ Qi and 1 < i < A:. Then = 

(p, g)(xj), hence both (a) and (b) hold independently whether T 2 is linear or 
not. 

(ii) Let ^ = 5(^1,..., foj some / > 0, 5 G and G 

RHS{Qi,A, k). Let, moreover, ^ be the right-hand side of the (p, 6 )-rule in 
R 2 - Then we have 

, • • • . 6)) = TT',e((6 . • • • . ^ 1 )) 

= (*) 

By the induction hypothesis, variables occurring in p(^j) also occur in 

for every p G Q 2 and 1 < j < /. Hence statement (a) holds. 

Now suppose that T 2 is linear and x,* occurs at most once in Then 
there is at most one 1 < i < /, such that Xj occurs in . Moreover, by the 
induction hypothesis, if x,- occurs in at most once, then it occurs at most 
once in 7’r^,p(^j), for every p G Q 2 * 

Since, for every p G Q 2 and 1 < n < /, variables occurring in rT^,p(^n) also 
occur in there is at most one 1 < n < /, such that x,- occurs in 7y',p(^n)- 
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On the other hand, Tj linear, because T 2 is, hence for every 1 < n < /, a 
subtree of the form p{xn) occurs at most once in Then it can be seen from 
(♦) that Xi occurs at most once in finishing the proof of (a) and (b). 

Now suppose that both Ti and T2 are linear. Let (p, q) E Q2 x Qi and 
a G for some k > 0. Then, for every I < i < k, X( occurs at most 

once in the right-hand side ^ of the (g, (r)-rule in R\. Moreover, by the above 
statement, for every I < i < k, the variable Xi occurs at most once in r7’^,p(0j 
which is, by the definition of T, the right-hand side of the ((p, g), (r)-rule in 
R. Hence T is linear. □ 

We are going to show that superlinear top-down tree transformations are 
not closed under composition. This will be the consequence of the following 
stronger result. 

Lemma 3.42. l-HOM o sUTOP — sUTOP ^ 0. 

Proof. We prove this in the following way. We define a linear homomorphism 
tree transducer T\ and then we show that vr^ovr ^ sl-TOP^ where T is the 
superlinear top-down tree transducer introduced in Example 3.31. 

To this end, let T\ = ({pi}, i7,pi, i?i), where 

• i7i = and E = and 

• R\ is the set consisting of the three rules 

- Pi(5(a:i , X 2 )) ^(pi(a;i), Pi(ar2)), 

- pi(a) a. 

It should be obvious that T\ induces the tree transformation vr^ : 

Ti 7 , where, for t E , we have vr^{t) = t\ if and only if t\ is obtained from 
t by deleting every occurrence of 7 from t. 

Then, the mapping o tt : ► Ta can be explained as follows. For 

every t G o 77’)(t) = if and only if is obtained from t in the 

following way: first form ti by deleting all occurrences of 7 from then take 
the 1-path of ti (see Example 3.31). 

We prove vp^ o vp ^ sUTOP by contradiction, i.e., we assume that 
vp^ o T>p = vpt for some superlinear top down tree transducer T' = 
(Q', Sly A, go, ^0- We study which kind of rules are in R'. 

Our first observation is that the rule go(<^) — ► 0 ( should be in /?', because 

Tpt{a) = Of. 

Next we observe that vpt(j(a)) = a. This implies that either go(T(a^i)) — ► 
a £ R' OT there is a state q E such that the rules go( 7 (^i)) 
q{a) — ► a are in R\ 

It is easy to see that the first case is impossible. Otherwise, if go( 7 ( 2 :i)) ^ 
a is in R\ then, for every t G Ti, we have go(7(0) ct, yielding that 
Tpf{y{t)) = a. However this is a contradiction because Tp'{j{t)) is the 1-path 
of ty by vp^ o Tp = Vpt. 
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Therefore 5 o(t(®i)) 9(^1) and q{a) — ► a are in ii' for some q E Q'. We 

show that necessarily q = qo. For this, we consider the (g,7)-rule, which, by 
the definition of a top-down tree transducer, must be in R. Let ^ = rhs{qj 7). 
We obtain easily that ^ cannot be in T^. If ^ G T^, then for every t 
we get qo{y^{t)) 9 (t( 0) ^ yielding that vrt{j^{t)) = for every 

i E Te. This cannot hold because 77^' (7^ (f)) is the 1-path of t. Thus xi 
should occur in However, since T' is superlinear, x\ can only occur in 
both rhs{qoyj) and rhs{q,j) in the case g = go* Hence we conclude that 
go(T(a?i)) go(ici) is also in R\ 

Next we observe that 77’/ (6(0:, a)) = la and that 77’/(5(5(a, a), a)) = 12a. 
It is an easy exercise to show that these two equations and the fact that T' is 
superlinear imply that qo{6{xi , X 2 )) — ► lp(a?i) must be in iJ', for some p G Q'. 
We show now that p / go* Otherwise, if p = go, then we have 

go(5(6(a,a),a)) =^t* lgo(5(o^» cr)) Hgo(a) =>T' 

contradicting 77’/(^(6(a, a), a)) = 12a. 

Hence p ^ go* Then xi cannot occur in rhs(p,7), because T' is super- 
linear and, as we already have seen, it occurs in rhs(go,7). Consequently, 
p(t(^i)) ^ is in R\ for some ^ G T^. 

However, we show that this is again impossible. If ^ G T^, then, for every 
f E Tj 7, we have go(^(7(f), <^)) lp(7(f)) 1^* On the other hand, 

T7’/(6(7(f), a)) = lu;, where w is the 2-path of t, which is a contradiction. 

With this we proved that 'Ppi ovr cannot be induced by any superlinear 
top-down tree transducer. □ 

Now we have obtained the following result. 

Theorem 3.43. The class sl-TOP is not closed under composition. 

Proof. By Lemma3.42 and by Corollary 3.32 we have sUTOPo sl-TOP — sl- 
TOP ^0. □ 

We will now prove two characterizations of TOP in terms of the compo- 
sition of its proper subclasses. In the proof of the first such theorem we will 
need to specify the right-hand sides of the rules of a top-down tree transducer 
in a more detailed form. Therefore we establish the following lemma. 

Lemma 3.44. Let T = (Q, i7. A, go, R) be a top-down tree transducer, k > 
0, and let ^ G RHS{Q, A,k). Assume, for every 1 < i < fc, the number of 
occurrences of X{ in ^ to be n». Let n = ni -h . . . + Then there exists a 
context t G Ca.ti and, for every 1 < * < fc, there exist states g,i, . . . , qim G Q, 
such that 

. . . , Zuj+l *— • • • , ^Uk+ni, ?ltnifc(2!fc)], 

where u,* = ni + . . . + nj_i for every 2 <i < k. (The statement is visualized 
in Fig. 3.8. Note that, for fc = 0, it is clear that n = 0 and t = ^ .) 
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Proof. Although the proof is rather technical, it can easily be performed by 
an induction on the structure of Therefore we omit the details. □ 
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Fig. 3.8. Detail of variables in ( 



The next theorem is a characterization of top-down tree transformations 
stating that top-down tree transformations are exactly those tree transfor- 
mations which can be obtained as the composition of a homomorphism tree 
transformation and a linear top-down tree transformation. 



Theorem 3.45. TOP = HOM o UTOP, 

Proof. The inclusion HOMol-TOP C TOP can be obtained by the following 
calculation: 

HOM o UTOP C TOP o TOP (by Corollary 3.32) 

= TOP (by Theorem 3.39). 

We show that the converse inclusion also holds. Therefore, let T = 
(Q, 17, -4, go> R) be a top-down tree transducer. We construct a homomor- 
phism tree transducer T\ = ({p}, i7, i7',p, i2i) and a linear top-down tree 
transducer T 2 = (Q 2 , -S", A, q 2 , R 2 ) such that vr = vr^ o vr^. 

Therefore let / be the number of rules in R. Let us fix an arbitrary linear 
order ri, . . . , r/ on the rules in R, Moreover, for every i > 1 and 1 < i < /, 
let n{iyj) be the number of the occurrences of a;, ’s in the jth rule rj. Let 

n = max{n{iy j) \ l < j < l,i> 1}. 

We first define Ti. Let i7' be the smallest ranked alphabet such that, for 
every A: > 0, = {< 7 ' | a G i7^*)}. Moreover, let R\ be the smallest set 

of rules such that, for every fe > 0 and a G the rule 
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n n 

/ ^ S / ^ M I I ^ 

p(<r(a:i, . . . , a;*)) a (p(xi), . . . ,p(a:i), . . • ,p(xfc), . . • ,P(aJit)) 

is in i?i, where every variable occurs n times on the right-hand side. (Note 
that, for Ar = 0, we have that p{a) — ► a* is in i2i.) 

We now define T 2 . Let Q 2 = Q, q2 = qo, and let R 2 be the set con- 
structed in the following way. For every 1 < j < /, let the jth rule rj in R be 
constructed as 

Tj = g(cr(a;i,...,a:t)) (♦) 

For brevity, let ni = n(l, j), . . = n{k,j) and n = ni + . . . + nt. By 

Lemma 3.44, there exists a context t G such that 

^ = t[z\ < 9ll(^l)) • • • ) ^ni ^ • • • 

. . . , ^ufc+1 ^ qkl{^k)i • • • ) ^Ufc+njt ^ qkriki^k)] 

where Ui is now equal to (i — 1) • n. 

Then, put the rule 

^((T (iCi, . . . , Xk.fi)) ► t[zi < • • • j Zfii * ^lni(^ni)) • • • 

• • • , ^tifc+1 ^ 9ikl(i«^(ib-l) n+l), • • • 

. . . , ^tifc+njk 9*nfc(^(A:-l) n+nfc)] 

in R 2 . (By the note we made in Lemma 3.44, in the case Ar = 0, we have 
and q{<T^) — ► ^ G ^2 ) 

Intuitively, the homomorphism tree transducer Ti handles the task of 
multiplying variables of T. Since T\ does not know how many times a variable 
must be copied, it makes n copies of every variable. This ensures that every 
subtree of an input tree to T\ is multiplied sufficiently many times. Then, T 2 
takes as many copies of a subtree as T has produced and operates on them 
in the same way as T did. The process by which T 2 chooses the necessary 
copies out of the n copies produced by T\ is visualized in Fig. 3.9. 





ni 


T»2 




nk 






hi 

1 


• • • qini 
1 


hi 

1 


• Q2n2 
1 


qki 
' 1 


. . qkrik 
1 




1 

Xi 


1 

. . . Xfii 


1 

Xuj+l .. 


1 

. X U2^n2 


1 

Xufc + l 


1 




T 


T 


T 


T 


’’ T 


T 




Xl 


. . . Xfii • • 


. Xn 3Jn+l • • ■ 


• • • • 2?2*n • ■ 


. . ®(fc_i).n+l • 


• • a;(fe-l) n+nfc • 


• • Xk’Tl 


















n 




n 




n 





Fig. 3.9. T 2 chooses the copies it needs out of the n copies produced by Ti 
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To show that vp = vr^ovr^, it is sufficient to prove that, for every s sTs 
and q £ Qj 'we have = 'rr 2 ,g(^i,p(«))- We prove this by structural 

induction on s. In order to avoid complex formulas, we put s' = ‘TTi.pC^)- 
Now let s = (7 G i^o- Then vr,q{s) = where ^ = rhs{q, a). On the other 
hand, s' = <t' and g((r') — ► ^ is a rule in J? 2 * Hence vr^,q{s^) = too. 

Now let s = <t(si, . . . ,Sjfc), where fc > 0, (T G and si, . . . ,sjk G Tj;. 
First compute vr,q{s). Assume that the (g,(7)-rule in R has the form as in 
(♦) and that all notations we defined for (♦) are valid for the rule involved. 
Then we have 

^.?(«) = Sk)) = ^[q{Xj) i- Tr,q{Sj); q{xj) € Q(Xib)] 

= t[zi ^ , Z„, <- (si), . . . 

• • • ) ^Uk+1 ^ • • • j ^u»+nii * (*<!)]• 

We now compute t^ 2 ,^(s'). We first observe that 
s' = 

where s|- occurs n times, for every 1 < i < fc. Hence 

• • • ) Zuic + l * • • • ) ^Ult+nic * ‘^3,qkn^ (®fc)]‘ 

Now, by the induction hypothesis, for every 1 < z < fc and 1 < j < n,-, 
^hus = ^ 2 ,g(^ 0 * completes the proof. 



Our next result is a decomposition result. It says that every linear top- 
down tree transformation can be obtained as the decomposition of a homo- 
morphism tree transformation and a superlinear top-down tree transforma- 
tion. 

Lemma 3.46. /-TOP C HOM o sUTOP. 

Proof. Let T = (Q, £*, ^, go, be a linear top-down tree transducer. 
Assume that R contains n rules. We define a homomorphism tree trans- 
ducer Ti = ({p},i7, L",p, Pi) and a superlinear top-down tree transducer 
T 2 = (Q 2 , 92 , ^ 2 ) such that vp = vr^ o Tr^. 

To this end, let ri, . . . , r„ be an arbitrary, but fixed order of the rules in 
P. 

We first define Ti. Let E' be the smallest ranked alphabet, for which 
*') = {(t' I (T G for every fc > 0. 

Then, let Pi be the smallest set of rules satisfying the following condition. 
For every fc > 0 and cr £ Eky let the rule 

n n 

^ s /' " ■ ' — s 

p(«T(ari, . ..,Xk)) <r'(p(a;i), . . . ,p(xi), . . .,p{xk), . • .,p(n)) 
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be in (Note that Xi occurs n times in rhs{p, < 7 ), for every 1 < i < fc). 

We now define T 2 . Let Q 2 = Q, 92 = 9o, and let R 2 be constructed as 
follows. For every 1 < i < n, consider the rule r,-. Assume that it has the 
form q{a{xi, , . .yXk)) — ► C for some fc > 0 and a G Then put the 

rule q{(T*{xij . . . , a?jb.n)) — ► in i? 2 , where the tree is obtained from ^ by 

substituting Xj in ^ by xy for every l< j <k. (More formally, we let 

* ®n+*) • • • j ®Jb ^ ®(jb— l)-n+*]*) 

We show that T 2 is superlinear. Since T is linear and, during the above 
construction, different variables in the right-hand side of a rule in R are 
substituted by different ones, T 2 is certainly linear. Moreover, take a (g, cr')- 
rule and a (g', <7')-rule from i? 2 i where a* G for some k>0, and g, g' G 

Q 2 are different states. Suppose that the (g, (r)-rule is the ith rule and the 
(g', (r)-rule is the jth rule in the sequence ri, . . . , r„. Obviously, i ^ j. Then, 
by the construction of i? 2 , Varx{rhs{qj a')) C {x,-, Xn+t, . . . , X(jb_i)„+,} and 
Varx{rhs{q'y a')) C {xj, Xn+j , . . . , X(ib_i)n+j}. Since, by i # j, 

• • • j ^(Jb— l)n+*} O {®j , • • • j = 0, 

we obtain that Varx{rhs{q, <T^))r\Varx{rhs(q\ a')) = 0. Hence, T 2 is super- 
linear. 

To prove that vr = t>Ti o vr^, it is enough to show that the following 
equivalence holds. For every s eTjj and q EQ, 'Pr,q{s) = The 

proof can be performed by an induction on the structure of s. We leave it for 
an exercise. □ 

To finish this section we give another characterization of top-down tree 
transformations in terms of homomorphism tree transformations and super- 
linear tree transformations. 

Corollary 3.47. TOP = HOM o sUTOP. 

Proof. 

TOP 

= HOM o l-TOP 
C HOM o HOM o sUTOP 
= HOM o sUTOP 
C HOM o l-TOP 
= TOP 

□ 



(by Lemma 3.45) 
(by Lemma 3.46) 
(by Corollary 3.40) 
(by Corollary 3.32) 
(by Lemma 3.45) 
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3.7 Composition Semigroup Generated by TOP^ 

UTOP, and HOM 

The motivation of this section 

We have so far considered five fundamental classes of top-down tree trans- 
formations: TOP,l-TOP,sl-TOP, HOM and UHOM. We have given their 
inclusion diagram in Sect. 3.5. Moreover we have obtained several compo- 
sition results and inclusions, proved in the Corollaries 3.32-3.47. We now 
observe that the knowledge we possess is sufficient to state the equality of, 
or the inclusion between, some further tree transformation classes which can 
be obtained by composition from the five fundamental classes. For example, 
let us take the tree transformation classes TOPoHOM oHOM oUTOP and 
HOM o TOP. Then we can easily prove that they are equal, that is to say, 

TOP o HOM o HOM o l-TOP = HOM o TOP. 

That is, we can show that both sides of the equality are equal to TOP by 
computing as follows: 



TOP o HOM o HOM o l-TOP 



= TOP o HOM ol-TOP 


(by Corollary 3.40) 


= TOP o TOP 


(by Theorem 3.45) 


= TOP 


(by Theorem 3.39) 


and 




TOP 





C HOM o TOP (by Lemma 2.14 and Lemma 3.38) 

C TOP o TOP (by Corollary 3.32) 

= TOP (by Theorem 3.39), 

proving also that HOM o TOP = TOP. 

On the other hand, let us consider, for example, the tree transformation 
classes HOM ol-TOP and l-TOP o HOM. We know that, by Theorem 3.45, 
HOM o l-TOP = TOP. Moreover, by Corollary 3.32 and Theorem 3.39, 
l-TOP o HOM C TOP o TOP = TOP. Hence we obtain 

l-TOP o HOM C HOM o l-TOP. 

However, at this point, we do not yet know whether proper inclusion or 
equality holds between these classes. 

This motivates us to deal with the following problem in this section. We 
would like to (and will) give an algorithm which can determine, given any 
two expressions built up with composition from some of our fundamental tree 
transformation classes, which relation out of C,D,= and 1X1 holds between 
them (recall that M stands for the incomparability relationship). 
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We structure the discussion as follows. First we abstract from the concrete 
choice of the fundamental classes, i.e., we consider instead an arbitrary finite 
set H, of tree transformation classes, and we present a general method which 
can construct an algorithm with the following property: given any two ex- 
pressions built up by composition from the elements of the algorithm can 
decide which of the relations holds between them. Then we apply the general 
method to a concrete choice of H consisting of four of our fundamental tree 
transformation classes. More precisely we carry out the two following main 
procedures, I and II, in this section. 

I. We give a general method which, given a finite set H of tree transfor- 
mation classes and given two tree transformation classes Ui o • • • o Um o,nd 
Vi o • • • o V^, such that, for every \ < i < m and 1 < i < n, Ui, Vj G H, 
determines which one of the following four conditions holds: 



(i) 


Uio.. 


■oUrr, 


= 


V'l 0- 


■oVn, 


(ii) 


Uio-- 


■oUrr^ 


c 


V^i o. 


■oV„, 


(iii) 


VlO- 


■oV„ 


c 


Uio. 


■■oUm 


(iv) 


UlO-- 


■oUm 




Vio-< 


■oV„, 



II. Then we apply this general method to the concrete set H = {TOP, I- 
TOP, HOM}, (We note that application to the whole set of our fundamental 
tree transformation classes would lead too far and exceed the scope of this 
book.) 

Presentation of the general method 

In this subsection we present the general method. The necessary notions of 
universal algebra were given in Sects. 2.1 and 2.5. However, in order to form 
the method, we need some more notions and notations as follows. 

Suppose that H is already given. We consider two semigroups in terms of H: 

• the free semigroup with the operation of concatenation (denoted here 
by •) and 

• the semigroup 



[H] = {Ui o • • • o Um I ^ > Ij G if , for every 1 < i < m} 

generated by H with the operation of composition (denoted by o). 

We also consider the homomorphism | | : [H] which is the unique 

extension of the identity mapping over H and for which, therefore. 
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for all elements ?7i , . . . , Um G H. In this way, strings over , i.e., elements of 
represent tree transformation classes in [H]. Finally, let us denote the 
kernel of | | by 0. 

We can now present the general method as follows. Before studying it, 
the reader is advised to consult Sect. 2.1 about the concept of a set of repre- 
sentatives. 

GENERAL METHOD: 

(a) Give a siring rewrite system S over H that presents [H], i.e., give a 
finite set S C H* x H* for which / Oj = [H], and such that = 0. 

(b) Give a set REP of representatives for the congruence classes of 

(c) Give the inclusion diagram of the subset \REP\ = {|u| | u G REP} of 
[H] , i.e., the set of tree transformation classes represented by the represen- 
tatives, 

(d) Give an algorithm that, for every string w G computes a repre- 
sentative u G REP such that w u. 

Next we show that if we can carry out the tasks described by (a)-(d) for 
the set H, we have also obtained an algorithm that has the desired property. 

Lemma 3.48. Suppose that the tasks (a)-(d) of the general method have 
been accomplished for H. Then there is an algorithm that determines, given 
two arbitrary tree transformation classes o • • o Um and Vi o • • o , where 
Ui, Vj G H, for every 1 < i < m and I < j < n, which of the conditions 
(i)-(iv) (of the previous subsection) holds. 

Proof. By (a) and (b), 05 = 0 and REP is a set of representatives for 
respectively. Thus, REP is a set of representatives for the congruence classes 
oi0, (Hence, \REP\ = [H], too.) 

The algorithm works as follows. Let us take the tree transformation classes 
i7i o • • -oUm and Vi o -o and form the words , ,.-Um and Vi • . . . • 
respectively. First, by algorithm (d), compute the representatives u and v 
such that 

t^i • . . . • C^m -<=>5 ^ and V"i • . . . • V. 

Then, since <=> 5 = 0, we also have 

Ui o , . ,oUm = W\ and Vi o . . . o = \v\. 

Thus, one of the conditions (i)-(iv) holds for the tree transformation 
classes C/i o • • • o Um and Vi o • o if and only if the corresponding condition 

O’) |w| = ki, 

Oi’) l«i C Ivj, 

(iii’) |v| C |«|, 

(iv’) |«| M |v| 
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holds for \u\ and |v|. However, having the inclusion diagram from (c), we can 
read from the diagram which of the conditions (i’)-(iv’) holds. □ 

We note that the general method only fixes a series of tasks; and the 
application of the method means that the tasks should be implemented for 
an actual monoid. However, it does not guarantee any result for a concrete 
application because during the implementation of a task we may come across 
difficulties that we cannot solve. Still, the method was successfully applied 
for several monoids - see the bibliographic notes at the end of this chapter. 



An application of the general method 

In this subsection we apply the general method to a concrete instance of 
namely, to the set 

H = {TOP, /-TOP, HOM}. 

Having applied the general method to such concrete instances of H, our 
experience has been that it is useful to implement the tasks (a)-(c) of the 
general method by performing the following five steps. 

STEP 1. In this step we produce the string rewrite system S, which is our 
candidate for presenting [H]. Let S consist of eight rules: 



1 ) 


TOP ■ TOP 




TOP 


2 ) 


TOP ■ HOM 




TOP 


3 ) 


HOM ■ TOP 




TOP 


4 ) 


HOM ■ l-TOP 




TOP 


5 ) 


l-TOP ■ l-TOP 




l-TOP 


6 ) 


TOP • l-TOP 




TOP 


7 ) 


l-TOP ■ TOP 




TOP 


8 ) 


HOM ■ HOM 




HOM. 



Since S should be such that 6 candidate S should fulfill this 

condition. However in this step, we can prove only the inclusion 0. The 
converse inclusion will be proved only in STEP 5. 

Lemma 3.49. 0 

Proof. First we show that for every (u v) G S, we have |u| = |t;|, or 
equivalently u0v. In words we say that elements of S are valid in [H]. 

We observe that Theorem 3.39, Theorem 3.45, Corollary 3.41, and Corol- 
lary 3.40 establish that rules 1), 4), 5) and 8) are valid is 5, respectively. 

Next we prove that 2) is also valid. This can easily be done by computing 
as follows. 
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TOP 

C TOP o HOM (by Lemma 2.14) 

C TOP o TOP (by Corollary 3.32) 

= TOP (by Theorem 3.39), 

which establishes that 2) is also valid in 5. The validity of the rules 3), 6 ) 
and 7) can be seen in a similar way, so we omit the detailed proof. 

Now we can prove that 0. In fact, let w,z £ such that w z. 
Then, by the definition of => 5 , there are strings x and y in H* and there is 
a rule (u — ► v) G 5 so that w = xuy and z = xvy. Then we can compute as 
follows: 



w\ 

xuy\ 

x\o |ti|o \y\ 
x\ o \v\ o \y\ 
xvy\ 



(because | | is a homomorphism) 
(because \u\ = \v\) 

(because | | is a homomorphism) 



proving that w0z. Analogously, we can prove that =^ 5 ^C 0, which yields that 
0 also. Finally, we obtain 9* = 9, proving the lemma. □ 



STEP 2. In this step we propose our candidate for the set of representatives 
of the congruence classes of Namely, we conjecture that 

REP = {TOP, HOM, l-TOP, l-TOP • HOM} 

is this set. However, we can prove that only in STEP 5. 



STEP 3. In this step we give the inclusion diagram of the tree transformation 
classes represented by the elements of REP, i.e., of the set \REP\ = {|u| | u G 
REP}. 

In determining the inclusion diagram of \REP\, we follow the method 
described in Sect. 2.2. That is, we conjecture that the inclusion diagram of 
\REP\ is the diagram in Fig. 3.10. Then with the method described in Sect. 
2 . 2 , we determine the sets of formal inclusions and formal inequalities whose 
validity is necessary and sufficient to prove that our conjecture is correct. 
The formal inclusions are: 

HOM C l-TOP o HOM 

l-TOP C l-TOP o HOM 

l-TOP o HOM C TOP. 

The formal inequalities are: 




102 3. Top-Down Tree Transducers 




TOP-UTOPoHOM 0 

UTOP o ROM - ROM 7^ 0 

UTOP o ROM - l-TOP 7^ 0 

^OM - l-TOP ^ 0 

/-TOP - ROM ^ 0, 

moreover, one can easily see that the minimal formal inequalities are: 

TOP - l-TOP o ROM ^ 0 

ROM - l-TOP ^ 0 

l-TOP - ROM ^ 0. 



According to the method, we now prove all inclusions and inequalities 
which will also prove that our conjecture is the inclusion diagram of \REP\. 
The second and third minimal formal inequalities have been proved in Lemma 
3.33 and Corollary 3.36, respectively. Thus it remains to prove the first min- 
imal inequality. 

Lemma 3.50. TOP - l-TOP o ROM 7^ 0. 

Proof. We show that the tree transformation Tchain-to- chains, induced by the 
top-down tree transducer appearing in Example 3.3, is not in l-TOP o ROM. 

We prove by contradiction, that is, we assume that Tchain-to- chains = 
7^1 o 7^2, where T\ = (Qi, i7, 51, Pi) is a linear top-down tree transducer 

and T2 = ({?}, q, R 2 ) is a homomorphism tree transducer. 

We observe a fact which can be easily proved by the reader. Let ^ G 
RRS{Qi^Q,l) such that x\ occurs exactly once in Then there exist a 
context i G Cn,i and a state p G Qi such that ^ = t\p{xi)]. The proof can be 
done by structural induction on 

Thus, since Ti is linear, the following condition holds for the 7-rules in 
Pi. If p(7(a?i)) — ► ^ is a 7-rule in Pi, then either ^ G T/2 or there is a context 
t G Cn,i and a state p' G Qi such that ^ = t\p'{xi)]. 

Now we prove a statement concerning computations carried out by T\. 
For this define 
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ki = max{height{^) is the right-hand side of some rule in R\}. 



Statement 1. Let p EQi and n > 0 be arbitrary. Then there is an integer 

0 < m < n, and there are contexts ti, . . .,^m € and a tree E Tn 
such that 

• xi occurs exactly once in any of , . . . , 

• for every 1 < i < m + 1, heighi{ti) < Ari, and 

• ^i.p(t"(«)) = h[h ■ • [tm+l] • • •]• 

Proof. We prove by induction on n. 

For n = 0, we put m = 0 and ti = r/is(p, a). Then we have 7^i,p(T”(tt)) = 
T^i^p(a) = ti. Moreover, ti G and height{ti) < fci, hence this case is 
proved. 

Now let n > 1 and let ^ = rhs{p,j). (Note that ^ G f?, 1).) 

Then, by the definition of 

Now two cases are possible. 

Case 1: ^ G 

Then we put m = 0 and t\ = Obviously, height{i\) < ki. Then we have 
finished, because, by the note made after Def. 3.22, we have = 

C = h, 

Case 2: The variable x\ occurs exactly once in Then, there is a context 
^1 G Cn^i and a state p' G Qi, such that ^ = ti\p^{xi)] and height{ti) < ki. 
Moreover, 

’T’,.p( 7"(«)) = 'rr,.c(7"“H“)) (by the definition of 7^,. p) 

= ^[p'(®i) rr,,p'(7"“H“))] (by Lemma 3.7) 

= ^i[t7’i,p'( 7"~^(‘^))]- (by the definition of ti) 

Now, by the induction hypothesis on n, there is an integer 0 < m' < n — 

1 and there are contexts G Ca,i and a tree € T^y such 

that 7 ^i,p'(t”'’^(<^)) = ^i[* * *[^m'+i] * * ’]• Moreover, for every 1 < z < m' + 
1, height{t[) < k\. Then, for the integer m = m' + 1 and trees ti,t 2 = 
ti, . . . , tm = C' » tm+i = C/+ 1 , we have 

^i.p(7"(a)) = h[h--- [Wi] • • •]• 

This ends the proof of Statement 1. 

Statement 2. Let n > 0 and s G T/^ be such that T^i(7”(a)) = s. Let 
s' be a subtree of s and assume that s' = <t(si , . . . , s/), for some a £ Q and 
si, . . . ,.s/ G T/ 2 . Then height(si) > k\ can hold for at most one I < i < I- 
Proof. By Statement 1, s = ti[t 2 • • [^m+i] - Jj where the trees ti , . . . , t^+i 
have the properties described there. Then there is an integer 1 < j < m H- 1, 
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such that all but one of the trees si , . . . , s/ should be a subtree of tj . Hence 
the heights of all but one of them should be at most ki . 

Now we can finish the proof of the lemma. For this, define 
k2 = max{height{^) is the right-hand side of some rule in R2} 
and let n > k2{ki -h 1) + 1. 

By our indirect assumption, there exists a tree s 6 T/j, such that 
7^1 (7*^(0)) = s and Tr^is) = This latter equation 

means that g(s) Let us show this derivation in 

detail. There is a place where 6 is derived, hence we can write 

?(s) 9(s') = 5(T“‘(9(«t)),7“^(9(«j))) 

where m > 0, <r € s' = tr(si, . . s^) € T/j is a subtree of s, and 

g(<r(xi,...,a:m)) ^(7“'(«(®.)).7“*(9(®j))) is the (g,<r)-rule in R2. Our 

first observation is that i ^ j. Actually, i = j would imply 7**(/?i) = 
which is impossible because /?i ^ ^ 2 - Then, by Statement 2, one of the trees 
Si and Sj, let us say s,-, satisfies the condition height(si) < k\. Then we have 
h < k\k2- We obtain 

n — 1 = ui -h /i < A?2 "b ^1 * ^2 = ^2(^1 H“ 1)) 

that is, n < ^2(^1 + 1) + 1 contradicting the choice of n. With this we have 
proved that Tchain-to- chains cannot be the composition of a linear top-down 
tree transformation and a homomorphism tree transformation. □ 

We can now prove that our conjecture for the inclusion diagram of \REP\ 
is correct. 

Lemma 3.51. The diagram in Fig. 3.10 is the inclusion diagram of the set 
\REP\ = {\u\\ueREP}. 

Proof. We have estabilshed in Lemma 3.33, Lemma 3.50 and Corollary 3.36 
that HOM-UTOP # 0, TOP-l-TOPoHOM ^ 0 and l-TOP-HOM # 0 
hold. At the same time it is an easy exercise to show that all inclusions shown 
by the diagram also hold. Hence the lemma is proved. □ 

STEP 4 - In this step we prove the following. 

Lemma 3.52. For every w G an element u G REP can be given such 
that w =>5 u. 

Proof. We prove the lemma by induction on length{w). 

Let length{w) = 1, implying that w E H. Then we also have w G REP^ 
by the definition of REP. Hence, for u = u;, we have u G REP and w =>5 u. 
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Now suppose that length{w) > 1, which means that u; = a: • (7, for some 
X G H'^ and U E H. Moreover, by the induction hypothesis on length{w), 
a representative v G REP can effectively be computed, such that x =^5 v. 
Then, regarding v, four main cases are possible and three subcases can appear 
concerning U within each of the four. In each case, we compute u. 

Case 1: v = TOP. Then w = x U =^5 v - U = TOP • U. 

• Subcase U = TOP. Then we have u — TOP, because TOP • U = TOP • 

TOP TOP, by rule 1 ). 

• Subcase U = l-TOP. Then u = TOP, because TOP • U = TOP • /- 

TOP TOP, by rule 6 ). 

• Subcase U — HOM. Then u = TOP, because TOP-U = TOP HOM =>5 
TOP, by rule 2 ). 

Case 2: v = UTOP. Then w = xU^*svU = l-TOP • U. 

• Subcase U — TOP. Then we have u = TOP, because l-TOP • U = l- 
TOP • TOP =>5 TOP, by rule 7). 

• Subcase U = l-TOP. Then we have u = l-TOP, because l-TOP - U = l- 
TOP • l-TOP =>5 l-TOP, by rule 5). 

• Subcase U = HOM. Then we have u = l-TOP HOM, because l-TOP U = 
l-TOP • HOM and there is no rule for l-TOP • HOM. 

Case 3: v = HOM. Then w = x - U =>5 v - U = HOM • U. 

• Subcase U = TOP. Then we have u — TOP, because HOM • U = HOM • 
TOP =>s TOP, by rule 3). 

• Subcase U = l-TOP. Then we have u — TOP, because HOM • U = 
HOM . l-TOP TOP, by rule 4). 

• Subcase U = HOM. Then we have u = HOM, because HOM • U — 
HOM • HOM HOM, by rule 8 ). 

Case 4: v = l-TOP • HOM. Then = x = l-TOP • HOM • U. 

• Subcase U — TOP. Then u = TOP, because l-TOP • HOM • U = l- 
TOP • HOM • TOP =>s l-TOP • TOP, by rule 3), and l-TOP • TOP =>s 
TOP, by rule 7). 

• Subcase U = l-TOP. Then u = TOP, because l-TOP • HOM JJ — l- 
TOP • HOM . l-TOP =^s l-TOP • TOP, by rule 4), and l-TOP • TOP =>s 
TOP, by rule 7). 

• Subcase U = HOM. Then u = l-TOP HOM, because l-TOP HOM U = 
l-TOP • HOM • HOM l-TOP • HOM, by rule 8 ). 

This finishes the proof of our lemma. □ 

STEP 5. In this step we can prove that the tasks (a)-(d) of the general 
method have been performed for our particular choice of H. 
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Lemma 3.53. The tasks (a)-(d) have been performed for 
H = {TOP, /-TOP, HOM). 

Proof. First we prove that = 6. By Lemma 3.49, 6. We now 

show that the converse inclusion also holds. Let ly, E such that wOw' , 
By Lemma 3.52 there are elements it, u' G REP^ such that w u and 
w' =>5 It'. Since 0, we also have 0, and hence w0u and w‘6u‘ , 

Recalling that ^ is a congruence, we obtain it^it'. However, by Lemma 3.51, 
two elements of REP represent the same tree transformation class in [H] 
if and only if they are equal, hence u = u' , Therefore w w* also holds. 
Thus, by [H] = j0 and 0 =-^ 5 , we get [H] = P“*"/ 4 ^ 5 . Hence task (a) 
has been completed for H. 

Next we prove that REP is a set of representatives for the congruence 
classes of Let w be an element of an arbitrary congruence class of ^*g. 
Then, by Lemma 3.52, there is an element u G REP such that w u, 
hence u is in the «^J-class of w. Now assume that the elements u, it' G REP 
are both in the ^*g class of w. Then, by -<=> 5 = 0, we have it^it', hence, by 
Lemma 3.51, we have it = it'. This shows that task (b) and, again by Lemma 
3.51, task (c) have been completed for H. 

Finally, Lemma 3.52 implies that task (d) has been performed. □ 

Now we can prove the main result of this section. 

Theorem 3.54. Let H = {TOP, /-TOP, HOM}. Then there is an algorithm 
that decides, given two arbitrary tree transformation classes U\o- • -oUm and 
Vi o • • • o Vny where //,*, Vj G P, for every 1 < z < m and 1 < i < n, which 
one of the conditions 



(i) 


Uio-- 


■oUm 


= 


V^io. 


■oV„, 


(ii) 


Uio-- 


■oUm 


c 


Vio- 


■oVn, 


(iii) 


Vio. 


••0I4 


c 


Uio. 


■ ■ ® ) 


(iv) 


Uio-. 






Vio- 


•oK, 



holds. 

Proof. By Lemma 3.53, the tasks appearing in the description of the general 
method have been carried out. Hence, by Lemma 3.48 the algorithm exists. 
□ 

An application of the algorithm 

We show by an example how our algorithm works. Let us take the two tree 
transformation classes 



/-TOP o HOM o /-TOP o /-TOP and /-TOP o l-TOP o HOM o HOM 
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which are constructed by composition from the elements of H. 

First, by Lemma 3.48, we have to compute the representatives u and v of 
the words l-TOP • HOM * l-TOP • l-TOP and l-TOP • l-TOP • HOM • HOM, 
respectively. We do this by applying the algorithm described in Lemma 3.52. 

An easy way to apply the algorithm of Lemma 3.52 is to prepare a ta- 
ble from which entries can be readily obtained later. For H = {TOP,l- 
TOP, HOM}, the table looks as follows. 





TOP 


l-TOP 


HOM 


TOP 


TOP 


TOP 


TOP 


l-TOP 


TOP 


l-TOP 


l-TOP 0 HOM 


HOM 


TOP 


TOP 


HOM 


l-TOP 0 HOM 


TOP 


TOP 


l-TOP 0 HOM 



This table, denoted by T, has the following properties. Its rows are labeled 
by elements of REP and its columns are labeled by elements of H. Moreover, 
for V G REP and U E H, the entry T{v, U) at (v, U) is also an element of 
REP such that v • U =>% T{v, U). 

Now, if there is a it; = i/i C /2 • • • • • G H~^, the following small program 
computes u G REP, for which w =>5 u. 

u := Ui] 

for i = 2 to n do 

T{u,Ui) 

By running this program on it; = l-TOP • HOM • l-TOP • l-TOP, we get 
u; = l-TOP • HOM • l-TOP • l-TOP =>s TOP, 



implying that 



l-TOP o HOM o l-TOP o l-TOP = TOP. 

Similarly, running the program on it; = l-TOP • l-TOP • HOM • HOM, we 
get 

l-TOP • l-TOP • HOM • HOM => J l-TOP • HOM, 
which implies that 

l-TOP o l-TOP o HOM o HOM = l-TOP o HOM. 

From the inclusion diagram appearing in Lemma 3.51, we can see that 
l-TOP o HOM C TOP. Hence also 



l-TOP o l-TOP o HOM o HOM C l-TOP o HOM o l-TOP o l-TOP. 
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Presenting [ff] by a terminating and confluent rewrite system 

In this subsection we show that the string rewrite system S which consists 
of the rules 1 ) through 8 ), is terminating and confluent as well (Sect. 2.4). 
Moreover we have chosen REP in such a way that REP = IRR{H, S) holds, 
where we recall that IRR{H, S) denotes the set of irreducible strings over 
H with respect to S (Sect. 2.6). This makes it easier to compute, for every 
w G the representative u G REP with w =>5 u. Then we can more easily 
compute the representative of a string representing a tree transformation 
class. First we prove the two initial statements (in the opposite order) and 
then we discuss their importance. 

Lemma 3.55. REP = IRR{H, S) 

Proof. Recall that REP = {TOP, HOM, UTOP, l-TOP • HOM}, The fact 
that REP C IRR(H, S) can easily be seen as follows. The length of the 
left-hand side of any rule in 5 is 2. Therefore none of them is applicable to 
TOPy HOM and l-TOP, hence these are in IRR{H, S). Moreover, there is 
no rule with a left-hand side l-TOP • HOM in S either, thus l-TOP HOM G 
IRR{H,S) also. 

To prove the converse inclusion, let, for n > 1, 

IRRn = {ly G IRR{H, S) I length{w) = n}. 

We show that IRRn C REP, for every n > 1. 

Obviously IRRi C REP, because IRRi = {TOP,l-TOP,HOM}. Next 
we observe that IRR2 = {l-TOP • HOM} because, as can be seen immedi- 
ately, for any elements U,V e H with U ^ l-TOP and V ^ HOM, U • V 
is the left-hand side of a rule in S, hence it cannot be in IRR2. Therefore 
IRR2 C REP, too. Now we show that IRR3 = 0, which arises from the fol- 
lowing observation. We observe that each element of IRR3, if there are any, 
should be either of the form U • u; or of the form w • U, for some w G IRR2 
and U E H, therefore either of the form U • l-TOP • HOM or of the form 
l-TOP • HOM • [/. However, both U • l-TOP and HOM • U are the left-hand 
sides of a rule in S, whatever U is, therefore neither U • l-TOP • HOM nor l- 
TOP HOM -U can be irreducible. Finally, IRR3 = 0 implies that IRRn = 0, 
for every n > 3, and thus IRRn C REP, for every n > 3. 

This also proves IRR{H, S) C REP. □ 

Lemma 3.56. 5 is terminating and confluent. 

Proof. The fact that S is terminating immediately follows from length{u) > 
length{v), for every rule u v E S. Hence, lengih{w) > length{z) for every 
w,z e H'^ with w z. By Lemma 3.55, IRR{H, S) = REP. Moreover, we 
saw in the proof of Lemma 3.53, that each congruence class of 0^5 contains 
exactly one element of REP and thus exactly one irreducible element with 
respect to => 5 . Hence, by Lemma 2.6, S is confluent as well. □ 
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The importance of Lemmas 3.55 and 3.56 is evident in that when com- 
puting the representative in REP for an arbitrary string w G we do not 
have to use the algorithm described in Lemma 3.52. 

Instead, we can do the following. Given a string w G we compute 
an irreducible element u G IRR{H, S) so that w u, by applying rules 
in S in an arbitrary order as far as we can. The process will stop because 
5 is terminating. On the other hand the same irreducible u will be reached 
in any case, because S is confluent. Since IRR{H^ S) = REP^ we also have 
u G REP, 

For example, take again the two tree transformation classes 
UTOP o HOM o UTOP o UTOP and UTOP o UTOP o HOM o HOM, 
Then we have 





l-TOP • HOM ■ l-TOP ■ l-TOP 




=>5 


l-TOP • TOP • l-TOP 


(by rule 


=^5 


l-TOP • TOP 


(by rule 


=>5 


TOP 


(by rule 


but also 








l-TOP ■ HOM ■ l-TOP ■ l-TOP 






l-TOP ■ HOM • l-TOP 


(by rule 


=>S 


l-TOP TOP 


(by rule 




TOP 


(by rule 



Similarly, we obtain that l-TOP • l-TOP • HOM • HOM UTOP • HOM, 
Since l-TOP o HOM C TOP, the same inclusion relation holds between the 
original classes, too, as we have seen already. 



3.8 Bibliographic Notes 

The concept of a top-down tree transducer was introduced in [Rou69, RouTOb] 
and [Tha70]. Both papers consider top-down tree transducers as formal mod- 
els which are suitable for studying properties of the syntax directed transla- 
tion method. Then fundamental properties of top-down tree transducers were 
more thoroughly investigated in [Eng75], [Bak78], and [Bak79]; special top- 
down tree transducers, and composition and decomposition problems were 
considered mainly. The results of Sect. 3.5 come from these papers. Lemma 

3.9 also follows from Lemma 3.1 of [Hue80], because there are no critical pairs 
arising from the rules of a deterministic top-down tree transducer. 

The top-down tree transducer is enriched with a regular look-ahead ca- 
pacity in [Eng77], which increases its transformational power and yields nice 
closure properties. In the papers [FV89a], [FV89b], and [FV89c] deterministic 
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top-down tree transducers with regular look-ahead were also considered. The 
connection between top-down tree transducers and the theory of L-systems 
was studied in the papers [Eng76] and [ERS80]. In [EngSO] a collection of 
problems, some of them still unsolved even now, was presented. The work 
[EngSl] is a study of the connection between the syntax-directed translation 
method and its formal models, among which top-down tree transducers were 
also considered. In [Eng82a] it was proved that surface languages of top-down 
tree transducers form a hierarchy with respect to composition. Leading on 
from this result, composition of (nondeterministic) top-down tree transfor- 
mations also forms a hierarchy with respect to the number of components 
appearing in the composition. The papers [Eng78] and [Eng82b] also deal 
with top-down tree transducers. A good, though not complete, collection of 
the results appearing in the above papers can be found with a uniform ter- 
minology in the handbook [GS84]. 

A relatively new area in the theory of tree transducers is looking for 
semigroups generated by tree transformation classes in which the inclu- 
sion (and hence the equivalence) of tree transformation classes is decidable. 
Such investigations were started in [FV87b]. Then, in the works [FV87a], 
[FV88a],[FV88b], [FV90], [FV91], and [Fiil91] a method was developed for 
finding an algorithm that can decide the inclusion in certain semigroups of 
tree transformation classes. Also in these papers the method was success- 
fully applied for two semigroups. A summary of these results can be found in 
[FV92b]. Further applications of the method occur in [SV93], [GV96], [DF96], 
and [DF98]. We have also applied this method in Sect. 3.7 of this book. 
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In this chapter we introduce another formal model of syntax-directed seman- 
tics: the macro tree transducer, which is an abstraction from denotational 
semantics. It is more powerful than a top-down tree transducer in the sense 
that it can handle context information (as discussed in Sect. 1.1). In fact, 
macro tree transducers handle the context information in an implicit way, 

i.e., there are parameters given to the states in which context information 
can be assembled and modified. In this way, macro tree transducers can be 
considered as a straightforward extension of top-down tree transducers. 

The structure of this chapter is as follows. 

1. Basic definitions 

2. Induced tree transformation 

3. Characterization of macro tree transformations 

4. Height property 

5. Composition and decomposition results 

6. Bibliographic Notes 



4.1 Basic Definitions 

Since context information is handled implicitly by having additional para- 
meters, we introduce a new set of variables to take over the role of these 
parameters. Let Y = {2/1, 2 / 2 , • • •} he a set of variables disjoint from X (intro- 
duced in Sect. 2.7) and, for every n > 1, let Yn = {2/1, 2/2, • - 2 /n}- In fact, we 
will sometimes call the elements of Y context parameters. 

As in the case of top-down tree transducers, we first define the set of 
right-hand sides, elements of which may be the right-hand sides of rules of 
a macro tree transducer. The new phenomenon with respect to right-hand 
sides of rules of top-down tree transducers is the possibility of nesting states 
in the parameter positions of other states. 

Definition 4.1. Let Q and A be disjoint ranked alphabets and let Ar, n > 0 
be integers. The set RHS{Q,A^k^n) of right-hand sides over Q, A^ k, and 
n is the smallest subset RHS of TQn^(Xk U Yn) satisfying the following 
conditions: 
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(i) For every p G with I > 0, I < i < k, and . . . ,6 ^ RHSy the 

term p(®», , . . . , C/) is in RHS. 

(ii) For every 6 G with / > 0 and , . . . , 6 ^ RHS, the term ^ (^i , . . . , 6) 
is in RHS. 

(iii) Yn C RHS, □ 

In case (i) we say that occurs in the j-th parameter position of p.” 

In the definition of a macro tree transducer, we also allow the initial state 
to have parameters. Since parameters carry context information, it is clear 
intuitively that the concrete values for the parameters of the initial state have 
to be given explicitly. In effect, these values form the environment in which 
the macro tree transducer operates. 

Definition 4.2. A macro tree transducer is a tuple M = (Q, i7. A, qo, ii, E), 
where 

• Q is a ranked alphabet, called the set of states; every state has at least 
rank 1. 

• E and A are ranked alphabets with Q C\ {E U A) = called the input 
alphabet and the output alphabet^ respectively. 

• ^ with r > 0 is a designated state, called the initial state. 

• ii is a finite set of rules such that, for every q G with n > 0 and 

a G E^^^ with fc > 0, there is exactly one rule of the form 

g(<7(xi,...,X]b),2/i,...,1/n) 

in R where a?i, . . ., Xfc G X, yi, . . . , j/n G y, and ^ G RHS{Q, A, k, n), 

• E is Si tuple (ti, . . . , tr) where ^i, . . . ,tr G and r + 1 = rank{qo); E is 
called the environment. 

□ 

By the following note, the formal model of the top-down tree transducer 
can be retrieved in an easy way from the formal model of the macro tree 
transducer. 

Note 4-3. If Q is a unary ranked alphabet, then M is a top-down tree trans- 
ducer. In fact, since the environment is the empty sequence, it can be dropped. 

□ 

The rule q{o‘{xi , . . . , Xk), 2/i, • . . , Vn) — ► ^ is called the (^, cr)-rule of M; 
the variables 2/i , . . . , j/n carry context information for the input subtree 
a{xi ^ . . . , Xk); ^ is called its right-hand side which is also denoted by rhs{q, a). 
Thus the expression ^ = rhs{q^ a) reads is the right-hand side of the (g, a)- 
rule.” Every (g, (r)-rule for some g G Q is also called a a-rule. 

Example 4-4- Let the macro tree transducer Mja/ = (Q, i7, 4, g, ii, E*) be 
defined by 
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. Q = {g(2)} 

• E = A = aW} 

• R consists of two rules: 

1 ) q{x2,q(xi,yi)) 

2) g(a,yi) -^( t(2/i,j/i) 

• E = (a). 

For instance, the first rule of Mbai is called the (g, (r)-rule. Its right-hand 
side, i.e., the term q{x 2 ,q{xiyyi)), is also referred to as rhs(g,cT). In partic- 
ular, the term q{xi,yi) is nested in the parameter position of the outermost 
occurrence of q. 

Later we will see that Mhai takes as input a tree s over E and produces 
as output the fully balanced tree of height k 1 where k is the number of 
leaves of s (see Example 3.29 for the definition of fully balanced trees). In 
the next section we will return to this example and state in a more formal 
way the input-output behavior of Mbah 



4.2 Induced Tree Transformation 

Now, for an arbitrary macro tree transducer M, we will define the tree trans- 
formation which is induced by M. We proceed in the same way as in Chap. 3: 
first, we define the derivation relation of M; second, we prove that the deriva- 
tion relation of M is locally confluent and terminating; and third, roughly 
speaking, we define the transformation of an input tree s as the normal form 
of the term ^o(«, 2/i, • • • , 2/n) (note that this normal form exists because of 
Corollary 2.7). Clearly, in the third step, we have to observe the environment 
of M. 

The following definitions and lemmas concern an arbitrary, but fixed, 
macro tree transducer M = (Q, E, qo, R, E), 

First, let us define the derivation relation of M (see also Fig. 4.1). 
We should point out that the derivation relation is a binary relation over 
^Qui7u^(^) and not just over TqusuAj i»e., the context parameters 2 /, ’s may 
occur in the terms which are derived, and in that situation they are consid- 
ered as nullary symbols. This slight extension is just a technical convenience 
for subsequent proofs. 

Definition 4.5. The derivation relation induced by M is the binary relation 
=>M over the set Tgu^uAiY) such that, for every £ Tqu:suA{y), we 
define (p :=^m V’) if 

• there is a context G Cqusua\jy,i 

• there is a state q £ with n > 0 

• there is a o’ G E^^^ for some k >0 

• there are trees si , . . . , sjb £Tjj 

• there are trees • • • , € TgusuAiY) 
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such that 

= /?[g(tr(si, . . . , s*), ¥>i, . . . , ipn)] and 

= /3K[xi <— Si, . . . , It <— Sk][yi <-<Pi,---,yn V?n]] where 

^ = r/is(g, (t). □ 

Note that <- st][j/i <- <pi,...,yn <- V’n] = <— 

<— Sk,yi *— <Pi,-.-,yn *— y’n]) ie., the sequential substitution 
can also be performed simultaneously. 




Fig. 4.1. A derivation step induced by M 



Example 4-6. Let us consider the macro tree transducer Mbai of Example 4.4 
and show a derivation step. Let 

(p = q{a, q{a{a, a), <r(a, j/i))). 

Then there are a context ^ = g(a, zi), a state q, an input symbol cr 6 
trees si = a, S 2 = and ipi = a{a,yi) such that <p = /?[g(<r(si, S 2 ), ^i)]- 
Then, y> V*. where 
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^ = P[rhs{<l, *-Si,X 2 ^ S 2 ][j /1 y>l]] 

= i{pt,zi)[q{x 2 ,q{xi,yi))[xi <- a ,®2 a][yi ^ <r(a,yi)]] 
= 3(a. ^i)[9(a. ?(«. yi)))] 

= ?(«.?(«> 9(a,«^(a,yi)))) 

We also show a sequence of derivation steps (see Fig. 4.2): 

g(o-(a, (7(a, a)), yi) 
a), q{oc, yi)) 

(by rule (1) with /? = zi) 

9(a, 9(a, ?(«, J/i))) 

(by rule (1) with /3 = z\) 

=>Afiai q(o‘,q(oi,<r(yi,yi))) 

(by rule (2) with ^ = q(a, q{a, zi))) 

=^Mm q{o‘,<r{<r{yi,yi),(T{yi,yi))) 

(by rule (2) with P = g(o', z\)) 

o-(«T((T(yi , yi), (T(yi, yi)), <r(o-(yi , yi), <r(yi , yi))) 

(by rule (1) with /? = zi) 




Fig. 4.2. A sequence of derivation steps of the macro tree transducer Mbai 



Definition 4.7. The set of sentential forms of M ^ denoted by 5F(M), is the 
smallest subset SF of Tqui7u/i(^) for which the following conditions hold. 

(i) For every p G with / > 0, s G Tjj, and . . . , ^/ G SF, the term 

p{s,(pi,...,(pi)eSF, 
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(ii) For every 6 G with / > 0 and , . . . , y?/ G SF, the term 8{(pi , . . . , 
is in SF, 

(iii) Y CSF. □ 

There is a relationship between sentential forms and right-hand sides 
which is similar to the situation for top-down tree transducers (cf. Lemma 
3.7). 

Lemma 4.8. (a) For every right-hand side ^ G RHS(Q^ A^ A:, n) with k^n> 
0, si,...,sjb G Ts, and y?i,...,^n € SF{M), the term ^[xi ^ si,.,,,Xk ^ 
Sk][yi ^ € SF(M). 

(b) For every sentential form (p G 5F(M), there are > 0, n > 0, ^ G 
RHS{Q, A, k, n), and si, . . . , sjb ETs such that <p = ^[xi si, . . ., Xjb ^ Sk]. 

Proof. Statement (a) is proved by structural induction on ^ similar to the 
proof of the corresponding lemma for top-down tree transducers (cf. Lemma 
3.7 (a)). 

Statement (b) is proved by structural induction on (p again similar to the 
proof of Lemma 3.7 (b). In particular, if (p has the form p(s, . . . , y?/) for 
some state p, then the proof uses the same technique of index shift as in part 
(ii) of the proof of Lemma 3.7 (b). □ 

Also the proof of the closure of SF{M) under =>m can be seen as a 
straightforward generalization of the corresponding proof for top-down tree 
transducers (cf. Lemma 3.8). 

Lemma 4.9. SF{M) is closed under =>m- 

Proof. We have to show that, for every <p G SF(M) and G Tgui7u^(^)) 
if ^ =>M then rj) G SF{M), 

The proof is performed by structural induction in the same way as in the 
proof of Lemma 3.8. In particular, if (p has the form p(s, ^i, . . . , y?/), then we 
have to distinguish two cases: first, a rule is applied to the root of and 
second, a rule is applied to one of the subtrees (pi. In the first case, the proof 
is similar to part (i) of the proof of Lemma 3.8; in the second case, the proof 
is similar to part (ii) of the proof of Lemma 3.8. □ 

It follows from the previous lemma that (5 F(M),=^m) is a derivation 
system in the sense of Sect. 2.4. Next we shall prove that the derivation 
relation is locally confluent and terminating. 

Lemma 4.10. The derivation relation is locally confluent on SF{M). 

Proof. Let pi, and p 2 be sentential forms of M such that p P>\ and 
^ ==>M ^ 2 - We show that there exists a sentential form rp G SF(M) such that 
=^Af and p2 

By Def. 4.5, for every i with i = {1, 2}, 
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• there is a context Pi E Cguru^uy,!, 

• there is a state qi G for some n,- > 0, 

• there is a (t,- G for some > 0, 

• there are trees Sii , . . . , , 

• there are trees , ^im G ^gui7u^(^) 

such that 

= ^2[?2(0’2(«21 1 • • • I S2fc3). V>21. • • • , ^2nj)] 

¥>2 = J^2[6[!'1 <-S21,---,a:t3 ^S2*3][yi <-¥’21,..-,yn3 ^¥>2f.3]], 

where, for every i with i = {1, 2}, the term = rhs(^,-, (T,*). 

Let Wi be the occurrence of z\ in Pi, The two cases in which 

1) wi = v )2 and 

2) w\ and W 2 are disjoint paths, i.e., neither is a prefix of the other 

are treated in the same way as in the proof of the corresponding lemma for 
top-down tree transducers (cf. Lemma 3.9) 

Now assume that wi is a proper prefix of W 2 - Then there is a j with 
1 < j < ni such that the term q 2 {(^ 2 {s 2 U • j «2Jb3), ^21, • • • , y^2na) occurs in 

ip\j. Moreover, there is a v such that W 2 = wijv. Assume that yj occurs k 
times in . Let (pij denote the term which is obtained from <pij by replacing 
the subterm at occurrence v by the term ^2[^ci ^ «2i> • • • ? ®a ?3 ^ ^2*3] [j/i ^ 
¥>2i. --.yna <- ¥’2na]- Then define V = ^ su,...,®*, +- su,] 

[yi ¥>11, • . . , yj-i ¥>i(i-i ), Vj ^ <Pij,yj+i ^ <Pm+i)y • • • , yn, ¥’in,]]- 
Clearly, <pi ip and (p 2 =>m ip. O 

Lemma 4.11. The derivation relation is terminating on SF{M). 

Proof. We first prove the following two statements by simultaneous induction 
(see Def. 3.10 and Principle 3.11). 

K: For every s ^ Ts and q G with n > 0, there is a function 

N such that for every y?i, . . .,y?n G SF{M), if the deriva- 
tions starting from (pi are not longer than Ci for every 1 < * < u, 
then the derivations starting from q{s,(pi, . . . ^(pn) are not longer than 
9q,s{ci ) • • • ) Cn). 

L: For every ^ G RHS{Q^A,k,n) with fc,n > 0, and lj = {si,..,,Sk) E 
(Ti;)*, there is a function g^^u; : — ► N such that for every ^i, . . . , E 

SF{M)j if the derivations starting from (pi are not longer than c,- for 
every 1 < * < u, then the derivations starting from ^[xi si, . . . , Xjb ^ 
Sk][yi ^ ^1, • • . , yn ^ y?n] are not longer than g^^w{ci , . . . , c„). 
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Proof of IB: Let ^ G RHS{Q,A,0,n) and let a; = (). The proof is by 
structural induction on 

(i) ^ cannot have the form . . ., 6 )- 

(ii) ^ z= 6 (^ 1 , . . for some 6 G and / > 0. If / > 0, then define 

1 • • • 1 ^n) — ) • • • ) ^n) “h • • • “h j • • • j ^n)j 3 >nd if / — 0 , then 

define g^,w(cu • • • , Cn) = 1- Note that, by induction hypothesis on this struc- 
tural induction, there are functions : N” — N such that the statement 
holds. 

Now let . . . , be arbitrary sentential forms in SF(M). Assume that 
the derivations starting from (pi are not longer than c,*. Then clearly, the 
derivations starting from ^ cannot be longer than . . . , c„). 

(iii) ( = yj for some yj G Yn^ Then with . . . , Cn) defined as Cj, the 

statement follows immediately. 

Proof of ISl: Let s = (t(si, . . . , sjk) E q E and Ar,n > 0. 

Then define . . . , c^) = ci -f . . . + Cn H- 1 + 9rhs(q,a),u,{ciy . • . , Cn) where 

a; = (si,...,sjk). 

An arbitrary derivation starting from q{s, ^i, . . . , ^n) looks either like 
=>M 

and then, if the derivations starting from pi are not longer than c,*, it is not 
longer than ci -f ... + Cn , or it looks like 

=>M rhs{q, a)[xi si, . . . , Xk Sk][yi Pi, . • . ,yn ^ (pn] 

Then, by using the induction hypothesis on L, such a derivation has max- 
imally the length ci -h . . . + c„ + 1 -f 9rhsiq,a),w{ci , . • . , c„). 

Proof of IS2: Let ^ G RHS{Qy A, k, n) and let a; = (si , . . . , Sk). The proof 
is by structural induction where the cases (ii) and (iii) are proved in the same 
way as in the proof of IB. Now let ^ have the form p(x,',^i, . . . ,^/) for some 
state p. Then define 

• • • ) ^n) — 9p,8i(,9^i,tj^(,^l j • • • > ^n)j • • • j > • • • j ^n))* 

By an argument which is similar to the proof of ISl, the statement follows. 
This ends the proof of statements K and L. 

Now let p G SF{M). Then, by Lemma 4.8(b), there is a 
^ G RHS{Q,Ajk,n) and there are si,...,Sjk G Ts such that p = ^ 

si, . . . , xjb ^ sjb]. Then, by L, there is a function : N” — ► N with lj = 
(si,...,Sib) such that the derivations starting from p are not longer than 
9^,uj{0 , . . . , 0). Note that the derivations starting from yj are not longer than 
zero steps. □ 
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Lemma 4.12. The derivation relation is confluent on SF{M). 

Proof. This statement follows immediately from the facts that is locally 
confluent (Lemma 4. 10) and terminating (Lemma 4.11), and from Newman’s 
lemma (Lemma 2.5). □ 

Theorem 4.13. For every ip G 5F(M), the normal form n/(^M» ^) exists. 

Proof. The existence and uniqueness of the term nf{^M , ^) is guaranteed 
by the fact that is confluent and terminating on SF{M) (Lemmas 4.12 
and 4.11) and by Corollary 2.7. □ 

The following lemma will help us in the proof of the next theorem. It states 
that, when computing the normal form of a sentential form q(s, y?i, . . . , ^/), 
then this can be done by computing first the normal forms of the (pi ’s and 
then the normal form of n/(=^Af ? ?(s, u/(^M, V?i), • • • , w/(^Afj ^n)))* 

Lemma 4.14. For every q G n > 0, s G Tjj, and ^i,...,^n G 

5F(M), 

nf{=^M,q{sy , (pn)) = nf{=^M, ^i), • • • , nf{z^M, <pn))) 

Proof. By Theorem 4.13, (pi w/(=^Mj^»)j for every 1 < i < n. Hence, 
g(s, y?i, . . . , ^n) q{s, nf(=>M,Pi), • • • , nf{=>M, (Pn)- Thus our lemma fol- 
lows from Corollary 2.8. □ 



Theorem 4.15. For every p G 5F(M), nf{^j 4 ,p) G Ta{Y). 

Proof. ^ T^(^) is verified by proving the following two state- 

ments by simultaneous induction. 

K: For every s E Tj; and q G with n > 0, the term nf{=^M 

1 q{^i 2/1) • • • > 2/n)) ^ F^(Yn)- 

L: For every ^ G RHS{Qy A, n) with fc, n > 0, and si, . . . , G the 
term ^ Sjk]) G T^(yn). 

Proof of IB: Let ^ G RHS{Q, Zi, 0, n). Then it can be proved by structural 
induction over ^ that n/(=>‘M,0 G T^(yn)* The case in which ^ has the form 
5(^1) ---j^) is proved by a corresponding proof to that for top-down tree 
transducers (cf. Lemma 3.16). Now let ^ = yj for some j with I < j < n. 
Then n/(=>M,2/j) = Vj G TA{Yn). 



Proof of ISl: Let s E Tjj and q E with n > 0. Then there are 

k > Oj a E and si,...,sa; G Tj; such that s = (t(si, . . . , s*). Then, 
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^/(=>M,g(«, 2/1,..., yn)) = n/(=^^M,g(<^(«i,...,5fc),yi,...,yn)) = nf{=>M, 
rhs{q,a)[x\ <— Si,. . .,Xk *— «*]) by Corollary 2.8. By induction hypothesis 
on L, n/(=>-M, rhs{q,(r)[xi ^ si, ...,xjb ^ Sk]) G T^iVn). Hence, nf{=^M 
,9(«,yi,...,yn)) e T^(yn). 



Proof of IS2: Let ^ G RHS{QyA,k,n), Ar,n > 0, and si, . . .,5* G The 
proof is by structural induction on f where we represent [xi ^ si, . . . , xjb <— 
Sk] by [...]. 

(i) Let ^ ... ,6) for some g, x,*, and^i, . . . Then we represent 

by =» and the following holds: 

= ^/(=>, y(«»,6[...], . . . ,6[...])) 

= n/(=^,9(s<,n/(=i^,6[.. .]), . .., w/(==^^,6[...]))) (by Lemma 4.14) 

= w/(=^,«(«*,yi, . . .,y/)[yi ^ 6 [...]), . . .,y/ ^ ^/(=^^,6[.. .])]). 

By induction hypothesis on K, n/(:^M , 9(«», yi, . . . , y/)) G T^(V/). By in- 
duction hypothesis of this structural induction, for every 1 < J < /, »^/(=>m , 
^j[...]) G T^(yn). Hence, n/(=>-M, yi, . . . , y/)[yi ^ n/(=>M, 6 [..•]), ..., 
y/-n/(=>M,6[...])])€T^(yn). 

(ii) and (iii) are proved in the same way as in the proof of IB. 

Finally we can use the same method as in Lemma 4. 11 to finish the proof. 

□ 

Now we can define the tree transformation induced by M similarly to 
Def. 3.17. Again, the superscript der indicates that the tree transformation 
is defined by means of the derivation relation. 

Definition 4.16. Let £* = (ti, . . . , tr) be the environment of M. 

(a) Let q G with n > 0. The tree transformation induced by M with q 

is the function : Ts — ► T^{Yn) defined as follows: for every s £Ts, 
define = nf{=^M,q(s, yi, . . .,y„)). 

(b) The tree transformation induced by M is the function Ts 

defined by . .,yr <r]- □ 



We call a tree transformation a macro tree transformation if it can be in- 
duced by a macro tree transducer. The class of all macro tree transformations 
is denoted by MAC, 

Example The macro tree transducer Mja/ of Example 4.4 computes the 
tree transformation which takes a tree s over with fc leaves 

as input and produces a fully balanced tree of height fc -f- 1 over the same 
ranked alphabet as output. To verify this statement, first observe that, for 
every s with k leaves 
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?(s. yi) g(a,-yg(g, yi 

k k 

This observation can be proved by induction on Ar. Then, by starting from the 
innermost occurrence of g(o, . . .) and using the (g, a)-rule g(o:, j/i) — ► <r{yi , j/i) 
k times, the fully balanced tree of height A: + 1 over is derived. 

After substituting a for y\ according to Def. 4.16, we obtain the fully balanced 
tree of height A: + 1 over □ 



4,3 Characterization of Macro Tree Transformations 

As in Section 3.3, we will give an inductive characterization of for every 
macro tree transducer M. Proofs of properties can then take advantage of 
this characterization. The main difference from the situation of top-down tree 
transducers (cf. Def. 3.22) is the possibility that states may occur nested in 
the right-hand sides of rules. Thus, the following definition will contain a kind 
of second-order substitution (in IS2(i)). Recall that M denotes an arbitrary, 
but fixed, macro tree transducer (Q, i7. A, go, i?, E), 

Definition 4.18. We define the set 



and the set 

Gm = {riSfi ■■ {Tsf ^ Ta{Y) I A: > 0,^ e U RHS{Q, A, k, n)} 

n>0 

of functions by simultaneous induction as follows. 

ISl: For every g G Q, A: > 0, (T G and si, . . . , sjb G Tsy we define 

IS2: For every Ar,n > 0, ^ € RHS{Q, A,k,n), and w = (si, . . . ,5jb) € {TsY , 
we define inductively on the structure of 

(i) If ^ = p{xi,^i, . . .,^i) for some p € with / > 0, 1 < i < 

Ar, and 6 RHS{Q,A,k,n), then Tj|yj(w) = rjSfp{si)[yj *- 

1 < J < fl- 

(ii) If 4 for some 6 € A^'^ with / > 0 and € 

RHSiQ, A, k, n), then = 6{r}Si (a;), .... 

(iii) If ^ = yj , then = yj. □ 

Note that the base IB of the definition by simultaneous induction is part of 
the step IS2 (for fc = 0; cf. Principle 3.20). 
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Example 4-^9, Let us again consider the macro tree transducer Mhai of Ex- 
ample 4.4 and let us compute the value of the function argument 

s = (r(a, cr(a, a)). This computation should be compared with the derivation 
shown in Example 4.6; for ease of reference, we have indicated by an asterisk 
the lines in the following sequence of equalities which correspond to sentential 
forms in the derivation. We abbreviate M^ai by M. 

= ^ “)))]] 

= TjSfgi<r{a, a))[yi ^ yi]] 

= Tlf.^(a)[yi ^ TMfq(0‘)][yi ^ '^M,i(y„yo(0)] 

= ^ ^M,t(y..y,)(0)]bl ^ ‘^(^1,^0] 

=* ^ o-(yi,yi)][yi <- <^{yi,yi)] 

= '^M,t(y,,yo(0)bi ^ <^(yi.yi)]bi ^ <^{yuyi)] 

=* o-(yi,yi)bi ^ <^{yi,yi)][yi ^<^(yi,yi)] 

= o-(o-(o-(yi ,yi), o’(yi . yi )) . (^{(^{yi ,yi), ^ (yi . yi ))) 

□ 

Lemma 4.20. For every ^ G T^(Yn), A: > 0, and u G (Ts)’‘, the equality 

Proof. This statement is proved easily by structural induction pn □ 

In fact, it is also true for macro tree transducers that the functions de- 
fined inductively characterize the tree transformation which has been de- 
fined on the basis of the derivation relation. Clearly, corresponds to 
In order that we also have a corresponding function for with 
^ G RHS{Q^A,k,n), we define the function 'I'AiY) such 

that for every a; = (si, . . . , s*), = n/(=>M ^ 1 < * < *])• 

Theorem 4.21. K: For every s E Tjj and q E with n > 0, the 

equality r^^q{s) = rj^fqis) holds. 
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L: For every fe, n > 0, ^ E RHS(Q^ A, n), and u; = (si, . . . , Sjb) E {Ts)^ , 
the equality Isolds. 

Proof. We prove the two statements by simultaneous induction. 

Proof of IB: Let ^ E RHS{Q^ -4, 0, n) and u; E (7^)° = {()}. Then state- 
ment L is proved by structural induction on 

(i) ^ cannot have the form . . . ,^/). 

(ii) If ^ has the form ^(^i, . . . ,6)> statement follows immediately 

by applying the induction hypothesis of this structural induction. 

(iii) If ^ has the form yj € Y„, then = n/(^M,^[]) = nf(=^M,yj) 

= Vj = 

Proof of ISl: Let s € Ts and q G with n > 0. Then there are 

k >0, cr E and «i , . . . , Sfc G Tr such that s = <r(si , . . . , s*). We repre- 
sent (si , . . . , sjfc) by u. Then, = nf(^M,q{s, yi , • • • , yn)) = nf{=>M 

,?(<r(si,...,st),yi,...,y„)) = nf{=>M,rhs{q,<r)[xi <- si,...,x* <- st]) = 
with ^ = rhs{q,<r). By induction hypothesis on L and by Def. 4.18, 
we obtain = rj^,^(<r(si, . . . , s*)). Hence, (s) = 

Proof of IS2: Let ^ G RHS{Q,A,k,n), k,n>0, and w = (si,...,s*) G 
(Tj;)*. The proof is by structured induction on ^ where we represent [ii <— 
si,...,Xfc «-«*,] by [...]. 

(i) Let $ = p(a:j,^i, for some p, xi, and ^i, . . . Then 

= w/(=i>-Af,p(««, 6 [•••]. • • -.6[ •••])) 

= w/(=>-M,p(sf,n/(=>Af,6[-]),---.«/(=^M,^/[-]))) 

(by Lemma 4.14) 

= n/(=j^M,p(s.-,yi, •,yj))[yi ♦-n/(=^M,6[-]). - 
...,yi*~ n/(=>Mi6[--])] 

= r^!p{si)[yi ^ (w), . . . , yi ^ (w)] 

(by definition of and 

= •TM.p(««)bi ^ TiSi (w)l 

(by induction hypothesis of this structural induction and by 

induction hypothesis on it’) 

= '^lSfp(xi,(i fi)(‘^)' 

(ii) and (iii) are proved in the same way as in the proof of IB. □ 
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Theorem 4.21 states the equivalence of the two approaches to defining the 
tree transformation induced by a macro tree transducer M: one is based on 
the derivation relation of M and the other is based on inductively defined 
functions. This equivalence means that we can omit the superscripts der 
and ind from now on. However, in the proofs of the subsequent lemmas and 
theorems we will use the approach based on inductively defined functions. 



4.4 Height Property 

In this section we show that the height of the output tree of a macro tree 
transducer can be exponentially bounded in the height of the input tree 
(compare this with the situation for top-down tree transducers in Lemma 
3.27). 

Lemma 4.22. Let M = (Q, S, A^qo^ R, E) be a macro tree transducer. 
There is a constant c > 0 such that, for every s G Tj;, the approximation 
height{TM{s)) < holds. 

Proof. Let c be the maximal height of the right-hand sides of rules in R. The 
following two statements can be proved by simultaneous induction. Since the 
proof is straightforward, it is left to the reader. 

K: For every s E Tjj and q E Q, the approximation height {rM,q{s)) < 

^height(s) 

L: For every Ar,n > 0, ^ G RHS{QyA^k,n), and lj G (7^;)*, the ap- 
proximation height{TM,^{uj)) < height{^) • holds where max = 
maa?{0, height{si ), . . .,height{sk)}. □ 

In fact, there is a macro tree transducer which exactly performs the ex- 
ponential growth of the input tree. 

Lemma 4.23. There is a macro tree transducer Mexp = (Q, Ri E) 

with a monadic input alphabet such that, for every s G Tj?, the equality 
.,(«)) = holds. 

Proof. The following macro tree transducer Mexp = (Q> ?o, 

meets the condition on the height of the output trees: 

• Q = {9o^^} 

• S = A = {7^^^ 

. R consists of two rules: 

1) ?o(7(«i),yi) -+9o(a:i,9o(xi,yi)) 

2) go(a,yi) -^T(yi) 

• E= (a). 
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In fact, Mexp is very similar to the macro tree transducer of Example 4.4. 
However, in the first rule, the nested recursive call is applied to the same 
descendant as the outermost call. The height condition of output trees can be 
verified by a simple induction on n and thus the tree transformation induced 
by Mexp is 

□ 

Clearly, the composition of macro tree transducers can perform a tower 
of exponentiations. 

Lemma 4.24. For every r : T^; — ► with r G MAC^ and Ar > 1, there is a 

constant c > 0 such that, for every s G Tj;, the approximation height{r{s)) < 
exp{kj c • height{s)) holds where, for every Ar, n > 0, the number earp(Ar, n) 
is defined by induction on k such that exp{0^n) = n and exp{k -f l,n) = 

2 exp(k,n) 

Proof. Since r is an element of MAC^^ there are macro tree transducers 
Ml , . . . , Mk such that r = tmi o . . . o tm* • Let c,* be the constant associated 
with Mi by Lemma 4.22. By applying Lemma 4.22 repeatedly, we obtain 

h6ighi($) 

.1 

height{r{s)) < cj. 

It is easy to prove by induction on k that there is a constant c > 0 such that, 
for every r > 0, the approximation 

cjj. < exp{kj c • r) 

holds. □ 

An immediate consequence of Lemma 4.23 is that top-down tree trans- 
ducers are less powerful than macro tree transducers. 

Theorem 4.25. TOP C MAC. 

Proof. By Note 4.3, TOP C MAC. Now assume that TOP = MAC holds. 
Then there is a contradiction between the height-lemma for top-down tree 
transducers (Lemma 3.27) and the fact that there is a macro tree transducer 
Mexp which increases the height of the input tree in an exponential way 
(Lemma 4.23). □ 



4.5 Composition and Decomposition Results 

In this section we show that the many-fold composition of the class MAC 
yields a proper hierarchy. Moreover, we characterize MAC as the composition 
of some of its particular subclasses. 
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Theorem 4.26. For every jfe > 1, MAC^ C 

Proof. By composing Ar + 1 times the macro tree transducer Mexp of Lemma 
4.23, we obtain a tree transformation in which cannot be in MAC^ 

because of Lemma 4.24. □ 

Now we introduce two important subclasses of macro tree transducers by 
imposing restrictions on the form of the rules. In these subclasses, the number 
of times that a subtree of the input tree can be referenced, is restricted. 

Definition 4.27. Let M = (Q, i7. A, qo, R, E) be a macro tree transducer. 

1) M is called linear if, for every Ar > 0, cr G 1 < * < and q E Q, 
the variable x,- occurs at most once in rhs(g,(T). 

2) M is called superlinear, if M is linear and for every A: > 0, (t G 

and states q, g' G Q, the set Varx{rhs{qy <r)) fl Varx{rhs{q' , (t)) = 0. □ 

The class of tree transformations induced by linear (and superlinear) 
macro tree transducers is denoted by UMAC (and si- MAC, respectively). 
By definition sl-MAC C l-MAC. 

Note 4.28. l-TOP C l-MAC and sl-TOP C si- MAC. 

Let i7 be a ranked alphabet and let A be such that -4 = {(t' | (t G 
E} U {nil}. Every sequence (ti,...,t„) of i7-trees can be written as 
((t(si , . . . , Sm), ^2) • • ) ^n) if ^1 = ^(«i , • , «m)- Then we define the function 

bin which takes a sequence of trees over E as argument and delivers a tree 
over A as result: 

• For every n > 1, define bin{{<r{si , . . . , s^), ^2, • • , in)) = 

a'{bin{{si , . . . , s^)), 6m((t2 , . . . , t„))) 

• For n = 0, define bin {{ )) = nil. 

Example 4 ^9. Consider the following macro tree transducer M^in = 
(Q, E, A, q, R, E) where 

• Q = 

. <^/(2)^ y(2)^ j^/(2)j y 

• R contains the rules: 

1) i{K^i,X2,xz),yi) 5'(?(a;i,g(a!2,5(a;3,n»7))),yi) 

2) q{<r{xi,X 2 ), yi) (r'{q{xi,q{x 2 , nil)), yi) 

3) ?(7(®i).yi) -^V(9(a;i,m7),yi) 

4) 9(a,yi) a'(m7,yi) 

• E = (nil) 

Clearly, M^in is a superlinear macro tree transducer, because it has just one 
state. 

It can be proved that the tree transformation induced by M is 
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= {(s, 6tn((s))) I s € Te}. 



a 

An important subclass of superlinear macro tree transducers is the class 
of YIELD-functions. Intuitively, they capture the second-order term substi- 
tution which is inherent in the derivation relation of macro tree transducers. 
The second-order term substitution allows the replacement of a symbol at any 
node of a tree by another tree of appropriate rank, whereas the first-order 
term substitution only allows such replacements at leaves. In the definition of 
YIELD the set Z = {^i, ^2, • • .} of substitution variables is used; for r > 0, 
let Zr = {zi,Z2,...,Zr}. 

Definition 4.30. Let E and A be ranked alphabets, r > 0 be an integer, 
g : E^^^ T^{Zr) be a function such that, for every a G E^^"^ and every 1 < 
i ^ contains at most one occurrence of Zj, and let i = (fi, . . . ,fr) € 

The YIELD-function induced by g and f, denoted by yield g is the 
function of type Ts — ► Ta defined by 

yieldgii^s) = yieldg{s)[z\ . .,Zr <- tr]- 

The function yieldg : Ts — ► T^{Zr) is defined inductively on the structure 
of its argument. 

(i) For every a E E^^\ we define yieldg{oc) = g{pt), 

(ii) For every cr E with Ar > 0 and sq, «i, . . . , s* G Tj;, we de- 

fine yieldg{(r{sQ,su..,,Sk)) = yieldg{8Q)[z\ ^ 2/*e/d^ (si ),..., zjb ^ 
yieldg{sk% □ 

The class of all YIELD-functions is denoted by Y I ELD. The restriction 
that g{a) should be linear in Z (i.e., every Z{ occurs at most once in y(a)) 
becomes clear when proving l-TOP o YIELD C l-MAC in Lemma 4.36. 
Also note that, if in case (ii) Ar > r, then the substitution of yieldg(sr^i)^ ..., 
yieldg(sk) for Zr+i> • • respectively, has no effect on yieldg{$o). 

Example 4 - 31 . Consider the ranked alphabets E = A = 

Let r = 2 and define the function 

g : T^{Z2) as follows: g{a) = a{zi,Z2) and g{j 3 ) = 8{zi,a,Z2). 

Moreover, let i = (/?, 7(a)). 

Now consider the input tree s = 6(7(/?), a, (r(a, a)). Then yieldg ^{s) = 
yieldg{s)[zi ^/I,Z2*- y{a)]^ 

yieldg{s) = yield g{j{0))[zi yieldg{a), Z2 ^ yieldg{<r{a, a))] 

= {yieldg {P)[])[zi 4- g{a),Z2 ^ yieldg{a)[zi ^ yieldg{a)]] 
= { 9 {I 3 )[])[zi ^ (t{zi , Z2), Z2 (r{zi , Z2)[zi 4- <r{zi , ^2)]] 

= (6(2ti, a, Z2))[zi 4- a{zi,Z2), Z2 ^ <r{(r{zi, Z2)j Z2)] 

= 6{cr(zuZ2),a,6(zi,a,Z2)) 

Thus yieldg i{s) = 5(cr(^, 7(a)), a, 7(a)). 



□ 
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Indeed, every YIELD-function can be computed by a superlinear macro 
tree transducer. 

Lemma 4.32. YIELD C sUMAC. 

Proof. Let E and A be ranked alphabets. Let g : E^^^ T^{Zr) be a 
function and let i = (ti, . . . ,^r) E (TaY- We construct a superlinear macro 
tree transducer M = (Q, E, A, go, E) such that tm = yieldg j. 

Let Q = be the set of states; hence, go = P- For every fc > 0 and 

<T G E^^\ the set R consists of the rules: 

• If r -f 1 < Ar, then the following rule is in R: 

p((T(a?i,...,arjb),2/i,...,t/r)-^ 

p(a?l, p(x 2 , 2/1 , . . . , 2/r), . . . , p(Xr+l, 2/1 , . . . , 2/r))- 

• Ifl<fc<r+1, then the following rule is in R: 

p(cr(a:i,...,arjk),pi,..., 2 /r) 

p(xi,p(x2, 2/1 , • • • , Pr), . . . ,p(a?jfc, Pi, • . . , Pr ), PAr, . . . , Pr). 

• If fc = 0, then the following rule is in R: 

P(<^,Pl,-**,Pr) -^g{(r)[zi ^Pl,...,2:r ^Pr]. 

• E = ^r). 

We can prove the correctness of the construction by proving the following 
statement by structural induction on s. 

(♦) For every sETs, yieldg{s) = tm,p(5)[pi ^ 2ti, . . . , pr ^ Zr]. 

In the proof we will represent the substitutions [pi 2ri,...,pr Zr] 
and [zi ^yi,...,Zr Pr] by [p ^ z]r and [z ^ p]r , respectively. 

Induction base of (♦): Let s = a E E^^\ Then, 
yieldg{s) = yieldg{a) = g{a) 

= (p(Q^)[^ ^ p]r)[p ^ z]r 

= rhs{p, a)[y ^ z]r (by the definition of R) 

= TM,rhs(p,a){0)[y ^ z]r (by Lemma 4.20) 

= 7'M,p(ct)[p ^ z]r 
= 'TAf,p(s)[p ^ Z]r 

Induction step of (*): Let s = <r{si ^ . . . , s*). 
r + 1 < fe: 



yieldg{(r{si,...,sk)) = 

= yieldg(si)[zi *— yieldg(si+i); 1 < i < ^ — 1] 

= (7-Af,p(si)b ^ Ar)[zi ^ T-M,p(Si+l)[y z]r\ 1 < 1 < * - 1] 
(by induction hypothesis on (*)) 
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= (T-M,p(si)[yj rM,p(S;+l); 1 < J < r])[y z]r 

— ('rHf,rA5(p,<T)((*l> • • • ) ®fc)))[j/ ■' ^]r 

(by definition of R and Def. 4.18) 

= TM,p(<r(si , . . . , Sit))[y z]r. 



1 < A; < r + 1: 



yieldg{(T{si,...,Sk)) = 

= yieldg{si)[zi *— yieldg{si+i)-, 1 < i < A: — 1] 

= (rM,p(si)[y ^ Ar)[zi rM,p(s.+i)[y Ar \ i < i < Ar - i] 

= (rM,pi^i)[yj ^ T-M,p(sj+i); 1 < i < ^ - i])[y ^ ^]r 

= (T-A/,rh»(p,.r)((si, . • • , »t)))b ^ z]r 
= rA/.p(<r(si, . . . , Sifc))[y z]r. 

Thus, for every s € Tu, 

tm{s) = 

= rM,p(s)[yi <- < 1 , • . . , yr ^ <r] 

= (rAf,p(s)[yi ^ Zi,...,yr ^ Zr])[zi *-ti,...,Zr ^tr] 

= yieldg{s)[zi *-ti, . . . ,Zr ^ U] 

= yiMg{{s). 

O 



Example ^.55. Let us reconsider the yie/d-function of Example 4.31. Accord- 
ing to the construction of Lemma 4.32 it can be computed by the following 
macro tree transducer M = {Q, S, A, go, R, E) where 

• Q — and p= qo 

• R contains the rules: 

1) p{6{xi,x2, xs), yi,y2)-*p(xi, p{x2, yi , y2), p(®3, yi , ^ 2 )) 

2) p(<r(xi,a;2),yi,y2) -^P(a:i,p(x2,yi,y2).y2) 

3) p(7(a;i),yi,y2) -*p(a;i,yi,y2) 

4) p{ct,yi,y2)-*(r{yi,y2) 

5) p(Ayi,y2) 5(yi,a,y2) 

• E = {0,j(a)). 

Since the computation of the transformation induced by M on input tree 
s = 5(7(/?), a, <T(a, Of)) is (almost) the same as the computation of yieldg{s) 
in Example 4.31, we do not repeat it here. □ 

One of the main results concerning macro tree transducers is the charac- 
terization of MAC in terms of the composition of TOP and YIELD, i.e., 
MAC = TOP o YIELD (see Theorem 4.37). This means that we have two 
relationships: 
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1. for every macro tree transducer M , there is a top-down tree transducer 
T, a function g, and a sequence t (of appropriate types) such that tm = 
Tr o yield g f and 

2. for every top-down tree transducer T and YIELD-function yieldg i, there 
is a macro tree transducer M such that t>t o yield g { = tm- 

Let us briefly discuss the decomposition result (i.e., relationship 1). 
Roughly speaking, the main additional feature of macro tree transducers with 
respect to top-down tree transducers is the facility to perform second-order 
term substitution, i.e, replace a symbol at any node n of a tree by a complete 
tree t and substitute the subtrees of n at leaves of t, while top-down tree 
transducers can only perform first-order substitution, i.e., replace a symbol 
only at a leaf n of a tree by a tree t. 

From this point of view, the decomposition of a macro tree transducer 
M means that every derivation can be reorganized such that two phases 
appear: During the first phase, merely symbolic replacement is performed 
while postponing the second-order substitution. Then, in the second phase, all 
the second-order substitutions are done together. The first phase is performed 
by a top-down tree transducer T. In the output trees, T inserts additional 
symbols which indicate the way in which the second-order substitution will 
take place. Then, the YIELD-function “reads” these additional symbols and 
carries out the second-order substitution appropriately. 

Lemma 4.34. 

1) MAC C TOP o YIELD 

2) l-MAC C l-TOP o YIELD 

3) sl-MAC C sl-TOP o YIELD. 

Proof. Let M = (Q, L", A, go, LJ) be a macro tree transducer with E = 
(^i,...,tr) and r = rank{qo). Denote maar{n | U ^ 0} by mx. 

We construct a top-down tree transducer T = (Q', i7, T, i?'), a function 

g : ^ T^{Zmx)i and a tuple i G such that tm = vr o yieldg j. 

• Define 0' = {g' | g G Q}; for every q EQ, the state g' has rank 1. 

• Define P to be the ranked alphabet such that P^^^ = ^ ^ U 

{ai, . . . , amx} where the a’s are new symbols, and, for every k such that 

^ 0 or ^ 0, define P^^^ = {cjk} and P^^^ = 0 otherwise. 

• Define the function g : P^^^ — > by g(6') = 6{zi, . . . yZk) if 5 G 

A^^\ and g(a,') = Z{ for every i G {1, . . . , mx}. Note that g fulfills the 
condition that every Zj occurs at most once in g{6'). 

• If mx = r, then i = E, and if r < mx, then f = (^i , . . . , fr, , ^mar)- 

Note that mx < r is not possible, because go G and hence mx > r. 

• For the construction of the rules of T we need the set COMB = 
{COMBk^n\k,n > 0} of functions COMBk^n • RHS{Q,A,k,n) 
RHS{Q^ , Pyk) that are defined inductively on the structure of their ar- 
guments. 
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(i) For every p 6 with / > 0, 1 < i < and 

6 , • ••.€/ G RHS{Q,A,k,n), we define . . . ,^i)) = 

c,+i(p'(ar.-), COMBfc.„( 6 ), • • • , COMBk,„iCi))- 

(ii) For every 6 € with / > 0 eind , . . . , 6 € RHS{Q, A, k,n),v/e define 

COMBk,„mu. . = c,+i{6', COMSi.„( 6 ), • • • , COMBk,„m. 

(iii) For every 1 < J < n, we define COMBic,n{yj) = “j- 
• Define 

R' = {q'{(r{xu. . . , Xk)) ^ COMBk,„iO 1 

(g(<T(a:i,...,a:ife),yi,...,p„) € R}. 

Now assume that M is linear. Since the function COMBk^n does not copy 
variables x,-, T is also linear. Now assume that M is superlinear. Since every 
rule of M is transformed into exactly one rule of T, T is also superlinear. 

The correctness of the construction can be shown by proving the following 
statements by simultaneous induction: 

K: For every n > 0, g G and 5 G Tj;, the relation yieldg{vr,q'{s)) G 

TA{Zfi) holds and the equality TM,q{s) = yieldg{vr,qf{s))[zi ^ 
2 / 1 ,..., Zn ^yn] is true. 

L: For every Ar,n > 0, ^ G RHS{Q,A,k,n), and u G (T 27 )*, the relation 
2 /ieW^(TT,coMBfc,n( 0 (^)) ^ T^iZn) holds and the equality = 

yie/d^(rT,coMBfc,n(«(^))[' 2 ^i ^ 2 /i) • • • » ^ 2 /n] is true. 

The proof is standard and hence it is left to the reader. 

Now we can conclude as follows where we represent the substitutions 
[z\ ^yn], [yi ^h,--,yr and [zi ^ti,...,Zr ^ U] by 

[z y]m [y ^ t]r, and [z i]^ respectively. (Recall that r + 1 = ranfe(go)-) 

tm{s) = 

= TM,qo{s)[y <r-t]r 
= (yieW^(TT,^i(s))[^ ^ y]r)[y t]r 

= yieldg{TT,q»^{s))[z ^ i]r 

= yieldg{Tr^ ffix because 

= yie/d^(7^(5))[2: 

= y^^idgt{vr{s)). □ 



Example \.35, Let us reconsider the macro tree transducer M^ai = 
(Q, E, A, g, R, £■) of Example 4.4, where: 

• <9 = 

• i: = -d = {<T(2 ),aW} 

• R consists of the rules: 

1 ) q{(^{3:i,X2),yi) ^ q(x2,q{xi,yi)) 

2 ) q(a,yi) ^ <r{yi,yi) 

• E = (a). 
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We apply the decomposition of Lemma 4.34 to Mhai- Then mx = 2 and 
we obtain the top-down tree transducer T = (Q', L, J*, g', R) with 

• Q' = 

• r = U U 4^^} 

1 ’) q'{<r{xi,X2)) ^ C2{q'{x2),C2{q'{xi),ai)) 

2’) ?'(a) C3(o-',ai,ai) 

and the function g ; -* T^{Zi) with = <r{zi,Z 2 ), g{Q') = a, and 

g(ai) = zi. 

Moreover, we obtain the tuple i = (a, Z 2 ) because r = 1 < 2 = mx. 

For the input tree s = a{a,a) we show a derivation of Mi,ai and the 
corresponding derivation of T in Fig. 4.3. In fact, the normal form of Mja/’s 
derivation is the image of the normal form of T’s derivation under yieldg . □ 

In general, for every derivation of T there is a corresponding derivation 
of M, and for every call-by- value derivation of M there is corresponding 
derivation of T. A derivation is call-by- value, if in Def. 4.5 the (pi's do not 
contain states. In fact, for the (non call-by- value) derivation 





g(o-(a,a),yi) 


^Mhal 


g(a,y(a,yi)) 


^Mhal 




=>Afbai 


<r(<r(yi,yi),?(a,yi)) 




o-(o-(j/i,yi).o’(yi.yi)) 


of Mbai there 


is no corresponding derivation of T. 



Lemma 4.36. 

1) TOP o YIELD C MAC 

2) l-TOPoYIELD C l-MAC 

3) sUTOPoYIELD C sUMAC, 

Proof. Let T = (Q, JC, T, go, A) be a top-down tree transducer. Let A be 
a ranked alphabet, g : ► T^(^r) be a function such that, for every 

a € g(o) is linear in Z, and i = (ti, . . . ,tr) G {TaY for some r > 0. 
We construct a macro tree transducer M = (Q', i7, id, go, ii', such that 

vr o yieldg t - tm- 

• Define Q* = {q' \ q £ Q} and, for every q £Q, the state g' has rank r -i- 1. 

• Let mx = max{k\L^^^ ^ 0} and Q{Xmx) = {q{^i)\q £ ^ Xmx}- 

Extend g to g' : r^^^UQ(Xmx) TAuQ>{ZrUXmx) by defining g'(g(x»)) = 
q^i^iy ^ 1 , • • • , Zr)- (Clearly, for every a G we have g'(a) = g(a).) 

• ii' is the smallest set of rules such that, if q{(r{xi , . . .,Xjb)) — ► ^ is in ii, 
then the rule 

q'{(T{xi,. ..,Xk),yi,...,yr)^ yieldg, {^)[zi *- yi,. . .,Zr ^ yr] 



is in R\ 
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Fig. 4.3. Comparison of derivations of Mhai and T 



Again it should be clear that, if T is linear (superlinear), then M is also 
linear (superlinear). This is based on the linearity property of g with respect 
to variables Zj . The correctness of the construction is verified by proving the 
following two statements by simultaneous induction. 

K: For every q E Q and s G T^, the equality yield g{vr^q{s))[zi 
yi, . . . , ^ 2/r] = TM,q'{s) holds. 
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L: For every Ar > 0, ^ G iZ£T5(Q, T, A;), and u) G (Ti)*, the equality 
yieldg{vr,i{u))[zi <- yi,...,Zr <- j/r] = lioWs where i' = 

yieldg! (^)[^i <- yi , . . . , Zr ^ J/r] • □ 

Ftom the previous two lemmas we obtain the following theorem which 
characterizes MAC in terms of TOP. 

Theorem 4.37. 1) MAC = TOP o YIELD 

2) l-MAC = l-TOPo YIELD 

3) sl-MAC = sUTOP o YIELD. 

As a corollary we obtain the closure of macro tree transformations with 
left-composition by top-down tree transformations (and also by homomor- 
phisms). 

Corollary 4.38. 1) TOP o MAC = MAC . 

2) HOM o MAC = MAC 

Proof. 

TOP o MAC = TOP o TOP o YIELD (Theorem 4.37) 

= TOP o YIELD (Theorem 3.39) 

= MAC (Theorem 4.37) . 

The proof of the second equality can be derived in the same way by using 
HOM o TOP = TOP (Theorem 3.39 and Lemma 2.14). □ 

Another consequence of the decomposition of macro tree transducers in 
top-down tree transducers and YIELD-functions is an analogy to the decom- 
position result TOP = HOM o sl-TOP (Corollary 3.47) for MAC. 

Corollary 4.39. MAC = HOM o sl-MAC. 

Proof. 

MAC = TOP o YIELD (Theorem 4.37(1)) 

= HOM o sl-TOP o YIELD (Corollary 3.47) 

= HOM o sl-MAC (Theorem 4.37(3)). 

□ 

The composition closure of macro tree transducers from the right with 
top-down tree transducers or homomorphisms cannot be obtained immedi- 
ately as a corollary of results known up to this point. However, in Chaps. 5 
and 6 we will prove results from which these closure results follow. And since 
the following closure result fits here, we include it and assure the reader that 
there is no loop in our proof tree. 

Theorem 4.40. 1) MAC o TOP = MAC 

2) MAC o HOM = MAC. 
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Proof. 

MACoHOM 



MAC o TOP 
HOMoATToTOP 
HOMoATT 
MAC 



(Corollary 3.32) 
(Theorem 6.25) 
(Theorem 5.47) 
(Theorem 6.25). 



□ 
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free tree grammars [RouTOa] (see also [ES78]). 

A formal model called the macro tree transducer was introduced in 
[EngSO]. Independently, primitive recursive program schemes were introduced 
in [CF82]. Actually, such schemes can be considered as many-sorted macro 
tree transducers. Example 4.4 is taken from [EngSO, EV86]. 

In [EngSl] composition, decomposition, and height results were sketched; 
they were investigated more thoroughly in [EV85] for the nondeterministic 
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In this chapter we introduce the third formal model of syntax-directed se- 
mantics: the attributed tree transducer. It is created by abstraction from 
attribute grammars. It can also handle context information like a macro tree 
transducer, and in this sense it too is more powerful than a top-down tree 
transducer. However, attributed tree transducers treat context information 
in an explicit way rather than implicitly as macro tree transducers do. 

The explicit possibility of handling context information gives rise to a 
new phenomenon: the derivation relation of an attributed tree transducer 
can be circular and hence, it is not terminating. Fortunately, this property 
is decidable although at a high price: the decision algorithm has an inherent 
exponential time complexity. But there are large subclasses of the class of non- 
circular attributed tree transducers which can model all problems occurring 
in practice and which have a polynomial time membership problem. For a 
noncircular attributed tree transducer, the derivation relation is terminating. 

The structure of this chapter is as follows. 

1. Basic definitions 

2. Induced tree transformation 

3. Characterization of attributed tree transformation 

4. Height and subtree properties 

5. Composition and decomposition results 

6. Bibliographic notes 

5.1 Basic Definitions 

Before starting the formal investigation, let us recall some features of top- 
down tree transducers. Figure 5.1(a) shows a sentential form of some top- 
down tree transducer T. Clearly, the trees <r(si,S 2 ) and 7 ( 53 ) are subtrees 
of some original input tree s with which the derivation of T began. In Fig. 
5.1(b) the same sentential form is represented in a different way: the input 
subtrees are replaced by their occurrences w and v, respectively, in the. com- 
plete input tree s. Recall from Sect. 2.7 that occurrences are sequences of 
natural numbers. 
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Fig. 5.1. Two representations of the same sentential form 



Now it is clear that, during the derivation of T, these occurrences only 
grow with respect to their lengths until they have reached the leaves of s. 
Then, from a technical point of view, attributed tree transducers are ob- 
tained from top-down tree transducers by also allowing these occurrences of 
s to shrink during the derivation. Thus an arbitrary tree walk can be per- 
formed on s rather than just a recursive descent (as it is done by top-down 
tree transducers and by macro tree transducers). In fact, the possibility of 
programming an arbitrary tree walk on the input tree opens up the possibil- 
ity of a circular derivation. Intuitively, the possibility of letting occurrences 
shrink means that, if a subtree, e.g., cr(si, S2) at occurrence u; of s is consid- 
ered, the computation of a meaning of (t(si , S2) may depend on the context 
of <t(si,S2). 

In attributed tree transducers, meaning names are called synthesized aU 
tributes, because they synthesize the translation of subtrees. Whenever the 
process of increasing the length of an occurrence of s is changed to decreas- 
ing such a length, synthesized attributes call inherited attributes. That is, the 
latter attributes allow information to be passed down (or inherited) informa- 
tion from the context of this subtree. In order to avoid confusion, the set of 
synthesized attributes and that of inherited attributes should be disjoint. 

Now we start the formal details by defining first (as usual) the set of right- 
hand sides of rules of attributed tree transducers. Since, in derivations of an 
attributed tree transducer, occurrences of the input tree may occur and these 
occurrences are manipulated (i.e., they lengthen or shrink), it is reasonable 
to expect a path variable in the rules of an attributed tree transducer. We 
introduce a path variable tt which ranges over the set of occurrences of the 
given input tree. 
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Definition 5 . 1 . Let As and A{ be unary ranked alphabets with ^ 50 ^,* = 0, 
be a ranked alphabet with A fl {As U .4,) = 0, and A: > 0 be an integer. 
The set RHS{As^Ai^A^k) of right-hand sides over As, Ai, A, and A; is the 
smallest subset RHS of TA,uAiU^({^i ^1, satisfying the following 

conditions. 

(i) For every a E As and I <i < k, the term a( 7 ri) is in RHS, 

(ii) For every b E A{, the term 6 ( 7 t) is in RHS. 

(iii) For every 6 E A^^^ with / > 0 and , . . . , G RHS, the term 5 (^ 1 , . . • , 6 ) 

is in RHS. □ 

The symbols tt, ttI, . . ., ttAt should be considered as strings, i.e., as elements 
of ({tt} UN)*, where tt is a symbol. We call tt a path variable because during 
the computation of an attributed tree transducer it will be replaced by actual 
occurrences of the input tree. 

Note that RHS{As , Ai, A, 0 ) = T^(A,({ 7 t})) (for the definition of A,({ 7 r}) 
see Sect. 2.8). 

Definition 5 . 2 . An attributed tree transducer is a tuple A = 
{Att, E, A, ao, R, E) satisfying the following conditions. 

• Att is a unary ranked alphabet, called the set of attributes; Att is parti- 
tioned into the disjoint sets Attsyn and AtUnhi of which the elements are 
called synthesized attributes and inherited attributes, respectively. 

• E and A are ranked alphabets with Att C\ {E U A) = ii, called the input 
alphabet and the output alphabet, respectively. 

• ao G Attsyn is a designated attribute, called the initial attribute. 

• /? is a set Uaei? finite sets Ra of rules; each Ra satisfies the following 
two conditions. 

(1) For every a E Attsyn and <r E 17^*^ with A: > 0, the set Ra contains 
exactly one rule of the form 

a( 7 r) i 

where ^ E RH S{Attsym AtUnh, A:). 

(2) For every 6 E AtUnh^ ^ ^ with A: > 1, and 1 < i < Ar, the set Ra 
contains exactly one rule of the form 

b{m) — > ^ 

where ^ E RH S{Attsyn, AtUnhi A, k). 

• E : Attinh is a mapping, called the environment. □ 

Note 5.3. 1) An attributed tree transducer which does not have inherited at- 
tributes is very close to a top-down tree transducer: the synthesized attributes 
correspond to the states, a rule a( 7 r) ^ in Ra with a E E^^^ and A: > 0 

corresponds to the rule a{a{xi, . . .,Xjb)) where is obtained from ^ 




140 5. Attributed Tree Transducers 



by replacing every a'(7ri) by a' (a?,*) in and the graph of the environment is 
empty. 

2) Vice versa, for every top-down tree transducer it is straightforward 
to construct a corresponding attributed tree transducer with no inherited 
attributes. □ 

For an attributed tree transducer A = {AUyS,A^ao,R^E) we observe 
the following notions and notations: 

• For every a G with Ar > 0, the set of inside attribute occurrences of a, 
denoted by in{a), is the set {a(7r) | a G Attayn} U {b{irj) | 6 G AtUnhy 1 < 
j < k}. The set of outside attribute occurrences of denoted by out{a), 
is the set {6(7 t) | 6 G AtUnh} U {a(7rj) | a G Attayn ^ < j < k}. The set 
of attribute occurrences of a, denoted by att{(r), is the set in{(r) U out{a). 
Note that, for every inside attribute occurrence c{ttt]) G in{(r), there is 
exactly one rule in with c( 7T7/) as a left-hand side. Moreover, attribute 
occurrences appearing in the right-hand sides of rules in Ra are in out{(r), 

• The right-hand side of the rule a(7r) ^ ^ in R^ is represented by 
rhs{a{ir)y(r) and the right-hand side of the rule b{m) ^ in R^ is rep- 
resented by rhs{b{Tri)^a). For every a G we denote by RHS{a) the 
set of right-hand sides of rules in R^, i.e., RHS{(t) = {rhs{c{7rrj),a) \ 
c{irrf) G in{(r)}. 

Example 5.4- Consider the attributed tree transducer 
Aahift = {Att, i7. A, ao, R, E) such that 

• Att = Attayn U Attinh with Attayn = {^0, Ol} and Attinh = {6}, 

• E = A = {#(!), 0,(0)^ 0,(0)}^ 

• E{b) = Q!i, 

• R = Ra,, 

R^ contains the rules: 

1) ao(7T) ^ #(ao(7Tl)) 

2) ai(7r) ^ ai 

3) 6(7t 1) — »■ ai(7rl) 

Ra contains the rules: 

4) ao(7r) — ► ao(7r2) 

5) ai(7r) — ► ai(7rl) 

6) 6(7t1) ai 

7) 6(7t 2) — o’(6(7r),ai(7r2)) 

Rai contains the rules: 

8) ao(7r) 6(7 t) 

9) ai{ir) ai 

and iJcra contains the rules: 

10) ao(7r) — ► 6(7 t) 

11) ai(7r) — ► Qf2- 
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Let us consider the input symbol a. The set m(<r) of inside attribute occur- 
rences of (T is the set {ao(7r), ai(7r), b{irl), 6(7t 2)}, and the set out{<r) of outside 
attribute occurrences of <t is the set {6(7 t), ao(7rl), ai(7rl), ao(7r2), ai(7r2)}. 
As another example for the use of abbreviations, rhs{b{7r2) , cr) = 
(7(6(7r),ai(7r2)) (i.e., the right-hand side of rule 7), and RHS(cr) = 
{ao(5T2), ai(jrl), ai, a(b{w), 
ai(5T2))}. 

In particular, if Ashift takes a right-growing comb s as input, then it 
produces a left-growing comb which has the same frontier as s. For in- 
stance, if s = #({7(ai,(T(ai,(T(a2,Qf2)))), then Ashift produces the output 
tree #(o'((r(<7(ai,ai),a2),a2)). □ 



5.2 Induced Tree Transformation 

In the rest of this chapter, let A = {Att, E, 4, oq, iZ, E) be an arbitrary, but 
fixed, attributed tree transducer. We define the derivation relation of A, the 
notions of circularity and a circularity test, and the definition of the tree 
transformation induced by A, 

Derivation relation 

The derivation relation (Figs. 5.2 and 5.3) is parameterized by an input tree 
s which is considered as a global variable during the whole derivation. In the 
following definition, the occurrences of s are considered as nullary symbols. 

Definition 5.5. Let s £ Tjj, The derivation relation induced by A on s 
is the binary relation over the set Tauuauocc(s) such that for every 

€ Tauuauocc(s) we define <p rp, if 

• there is a context G Cauuauocc(s),i> 

• there is an attribute c G Att, 

• there is a path w G occ(s), 

such that (p = (3[c{w)] and exactly one of the following two conditions holds. 

(1) c G Atisym label{s, w) = cr, a E E^^\ k>0, and 

^ w]]y where ^ = rhs{c{ir), a), 

(2) c G Attinhi u; = vi for some v G occ(s), label{s, v) = a E E^^\ ^ 

1 < i < fe, and ‘ip = *— v]], where ^ = rhs(c(7ri), a). 



□ 
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Fig. 5.2. A derivation step induced by A triggered by a synthesized attribute 



Example 5.6. Let us show two typical steps of an attributed tree transducer. 
For this purpose, consider again the attributed tree transducer Aghift of 
Example 5.4. Let s = #((r(ai, a 2 ))) G Ts be an input tree and let 

^^ = #(ao(122)). 

Then there are a context /? = #( 2 ^ 1 ), an attribute c = ao, and an occurrence 
w = 122 G occ(s) with /a6e/(s, 122) = a 2 , such that (p — P[c{w)\. Moreover, 
(p using rule 10 (Fig. 5.4(a)), where 

^ = #{zi)[rhs{ao{TT), a2)[7r ^ 122]] 

= #(^i)[K^)[^ ^ 122]] 

= #( 6 ( 122 )). 

Another derivation step can be performed on tp. Let /?' = #( 2 : 1 ), c' = 6, 
w — 122, u = 12, i = 2 with label{s^ 12) = <r. Then rj) — /?'[c'(u;')]. Moreover, 
^ 'ising rule 7 (Fig. 5.4(b)), where 
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Fig. 5.3. A derivation step induced by A triggered by an inherited attribute 

= #(^i)[»*/»s(6(jr2), <t)[x 12]] 

= #(«i)K*»W,ai(7r2))[7T 12]] 

= #K6(12),ai(122))). ^ 

Definition 5.7. Let s eTj;, The set of sentential forms of A and s, denoted 
by SF{A, s), is the smallest subset SF of T 4 ttu^uocc( 5 ) for which the following 
conditions hold. 

(i) For every c G Att and w G occ(s), the term c(ii;) G SF, 

(ii) For every 6 G with / > 0 and <pi,, . ,,<pi G SF^ the term S((pi , . . . , 

is in SF, □ 

Now we state a connection between right-hand sides and sentential forms 
which is similar to the corresponding connection for top-down tree transduc- 
ers (cf. Lemma 3.7(a)) and for macro tree transducers (cf. Lemma 4.8(a)). 
Since the proof is very similar to the lemmas mentioned we omit it here. 
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Fig. 5.4. Two particular derivation steps induced by Aahift 



Lemma 5.8. Let ^ G RH S{Att8yn, Attinhj k) with k > 0, s e Ts, and 

w G occ{s) such that for every 1 < i < Ar, G occ(s). Then ^[tt ^ u;] G 

SF{A,s), □ 

A connection between right-hand sides and sentential forms similar to 
Lemma 3.7(b) for top-down tree transducers and Lemma 4.8(b) for macro 
tree transducers cannot be stated here. This is due to the fact that, in the 
right-hand side of a rule of an attributed tree transducer, the attributes 
refer to a node in the neighborhood of the node to which the attribute in 
the left-hand side refers. More precisely, in the right-hand side of a rule 
a{ir) — > the attributes refer to either the same node as attribute a or some 
descendants, and in the right-hand side of a rule 6(7 tz) — > the attributes 

refer to either some sibling of the node to which 6 refers, or its ancestor. 
However, in an arbitrary sentential form, the attributes involved need not 
refer to neighboring nodes of a tree - they can refer to arbitrary nodes. 

We note that, in the study of top-down tree transducers and macro tree 
transducers, the representation of sentential forms as instances of right-hand 
sides (Lemmas 3.7(b) and 4.8(b), respectively) is used in the proof of ter- 
mination of the corresponding derivation relation (Lemmas 3.13 and 4.11, 
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respectively). However, for attributed tree transducers, we will prove the cor- 
responding termination property in a different way (see Lemma 5.24). 

Lemma 5.9. Let s be an input tree of A. Then SF(Ay s) is closed under 

^A,5* 

Proof. Let <p G SF{A^s) and xp G Tauua{occ{s)) such that <p We 

prove that xp G SF{Ay s) by structural induction on <p, 

(i) Let <p = c{xv) for some c G Att and xv G occ(s). 

Case 1: Let c G Attgyn and let label{s, xv) = a. Then xp = rhs{c{ir), (r)[Tt <— 
xv]. By Lemma 5.8, it follows that xp G SF{A, s). 

Case 2: Let c G Attinh and let xv = vi, i > 1, label{s,v) = a. Then 
xp = rhs{c{iri)^ <r)[jr v]. By Lemma 5.8, it follows that xp G SF(A, s). 

(ii) Let <p = for some 6 G I > 0, and 

€ SF{AyS), Since (p V'i there is an i with 1 < 

i < I and a xpi G Tauua{occ{8)) such that (pi =>a,s and xp = 
5(y>i, . . . , pi^i,xpi,pi^i, . . . , pi). By induction hypothesis, the term xpi G 
SF{AyS). Since ^i, . . . , ^,_i, V?*+i, . . . , ^ 5'F(A, s), it follows from Def. 

5.7 (ii) that xp G 5F(A, s). □ 

Circularity and a circularity test 

Clearly, since a derivation can take an arbitrary tree walk on an input tree, 
it is possible that the derivation is cyclic. 

Definition 5.10. 1) A is circular^ if there is an s G Tj;, there is a sen- 
tential form c{xv) with c G Att and xv G occ(s), and there is a context 
G CAuvAuocc{t),i such that c(w) J , P[c{w)]. 

2) A is noncircular, if it is not circular. □ 

Example 5.11, Consider the attributed tree transducer 
A = {Att^ 17, oo, i?, E) such that 

• Att = Attsyn U Attinh with Attsyn = {a} and AtUnh = {&}; uq = a, 

• i7 = ^ = {T('),a(o)}, 

• ii = jRy U iia, where 
iiy contains the rules 

1) a(7r) ^ a(7rl) 

2) 6(7t 1) -+ a(7Tl), 

and iicr contains the rule 

3) a{ir) -* b{ir), 

. E(b) = a. 

Then there is the following infinite derivation on input tree s = 7(a): 
fl(6^) ^a,8 a(l) ^a,8 ^(1) ^a,8 a(l) . . . 



Hence. A is circular. 



n 
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Thus, in order to state that an attributed tree transducer is noncircular, 
we have to check every input tree s for cycles in the derivation relation 
However, there is a way to associate a so-called dependency graph 
with every input tree to A such that A is circular if and only if there is a 
cycle in the dependency graph associated with some input tree. We develop 
this technique. 

First we define the concept of the dependency graph of an input symbol 
a with respect to A, The definition of att{a) is as in Sect. 5.1. 

Definition 5.12. Let a G with fc > 0. The dependency graph of cr 
for A, denoted by is the directed graph (att(<T)^ E{a)) with E{o’) = 

{(c(7rr7), c'(7T77')) G out{(r) x m(cr) | rhs{c\irrj^)y a) contains the subtree c{wrj)} 
as a set of arcs. □ 

We will use the usual graphical representation of the dependency graph 
Dyi((7). It is a trapezoid with the parallel sides drawn horizontally. The sym- 
bol <7 is shown on the upper line. The nodes of Da{(t) are placed on these 
parallel lines such that occurrences of inherited attributes and synthesized 
attributes appear to the left and right of (t, respectively. If the names of 
the attribute occurrences are clear from the context, then they are omit- 
ted. For example, in Fig. 5.5, the dependency graphs 

2 tnd DA,Hifi(ct 2 ) of the attributed tree transducer Aghift of Ex- 
ample 5.4 are shown. 

The local dependencies of input symbols described by the dependency 
graphs extend to global dependencies of attribute occurrences of input trees. 
Next we define the dependency graph of an input tree s for A by induction 
on the structure of s. 

Definition 5.13. Let s GTjj. The dependency graph of s for A, denoted by 
Da{s) is defined as follows: 

(i) For every s = a G Da{s) = Da{(t). 

(ii) Let s = (r{si,...jSk) for some A: > 1, (7 G and Si,...,Sjb G Tjj. 

Assume are already defined for 1 < z < fc and let Att{si) and 

E{si) be the set of nodes and edges of respectively. Now Da{s) 

is the directed graph of which the set Att{s) of nodes and set E{s) of 
edges are defined as follows: 

• Ati{s) = att{(r) U I c(7T7y) G 1 < z < Ar}) 

• E{s) = E(<t) LI {\Ji.i{{c{iriT]),c'{iriT]'))\{c{TrT]),c'{TrT)')) e E{si),l < 
i < A:}). 

□ 

Roughly speaking, the dependency graph of an input tree s can be ob- 
tained by pasting together the dependency graphs of the input symbols ap- 
pearing in s. In Fig. 5.6 we show the dependency graph of the input tree 
#(<r(ai, cr(a2, <r(a2, aa))))- 
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Now let us return to the circularity problem of A. The attribute occur- 
rences appearing in a derivation are determined by the rules applied during 
that derivation. Moreover, the rules induce a local dependency relation among 
the attribute occurrences of input symbols. The local dependencies induce a 
global dependency relation over the attribute occurrences of every input tree 
s in the way described above. Hence there is a close connection between the 
derivation relation =^a,s and the global dependency relation shown by Da(s). 
Namely, any derivation by =^a,s starting from an attribute occurrence (c, w) 
of s will ”call” (or ”use”) another attribute occurrence (c'jw') of s if and 
only if there is a path from (c', w') to (c, w) in Da(s). 

Therefore, in order to check whether A is noncircular, we have to check, for 
every input tree s, whether contains a cycle or not. A naive algorithm 

can certainly not do this. However there is a way to partition the set Tj; into a 
finite number of equivalence classes. Every equivalence class is characterized 
by a so-called is-graph. 

Definition 5.14. An is-graph (over Attinh and Attayn) is a directed graph 
(V, E) such that V = AtUnh U Attayn is the set of nodes and E C AtUnh x 
Attayn is the set of edges. □ 

Thus, in an is-graph, edges may lead only from inherited attributes to syn- 
thesized attributes. Clearly, since Ati is a finite set, there can only be a finite 
number of such is-graphs. More precisely, there are 

is-graphs. In broad terms, a single is-graph is defines the equivalence class 
{s G Tj; I there is an indirect or direct dependency between c{e) and c'(e) if 
and only if there is an edge between c and c' in is}. We will show a lemma 
(Lemma 5.17) which states that it is sufficient to consider only these equiva- 
lence classes in order to test circularity. Let us now formally define the notions 
involved and prove the lemma on which an algorithm can be based for the 
decision about circularity. 

First, for an attributed tree transducer A and an input tree s G T^;, we 
define the is-graph of s (which is an is-graph in the sense of Def. 5.14). Intu- 
itively, the is-graph of s shows the (global) dependencies between occurrences 
of inherited attributes and synthesized attributes at the root of s. In the fol- 
lowing we also require the set of all is-graphs of A. To support the definition 
of the is-graph of an input tree, we define the composition of graphs and their 
is-graphs. 

Definition 5.15. Let a G fc > 0, and Da{(^) = {ati{(r)^E{a)) be the 
dependency graph of a for A. Moreover, for every j with 1 < i < fc, let 
isj = (Aft, Ej) be an is-graph over AtUnh and Attayn- 

1) The composition of Da{<^) and isi , . . .,isjb, denoted by 
D^((r)[isi, . . . , isjb], is the directed graph (att{a), E) such that E = E{(t) U 
{(c(jrj), c'(jTj)) I (c, c') £Ej,l<j< k). 
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2) The is-graph of DA{<r)[isi , . . . , 15*], denoted by is{DA{o‘)[isi ^ . . . , 25*]), 
is the is-graph (AttyE) such that E = {(c, c') £ AiUnh x Attsyn | there is a 
path in DA{o‘)[isi , . . . , 25 *] from c(7t) to c'(7t)}. □ 

We can now define the is-graph of an input tree 5 for A. 

Definition 5.16. Let 5 = cr(5i, . . . , 5*) £Ts> 

1) The is-graph of s for A, denoted by 25 ^( 5 ), is an is-graph defined by 
induction on 5 as follows. 

(i) For every s = a £ define 

isA{s) = is{DA{a)[]), 

(ii) For every 5 = ( 7 ( 51 , . . . , 5 *) G with > 1, define 

isA{s) = is{DA{(T)[isA{si)y . . . , 25a ( s*)]). 

2) Define the set of is-graphs of A, denoted by is-setAy as follows: 25 - 

setA := {25a ( s) \s£Ts} □ 

Recall that is- set a is a finite set. 

Now we can use the notion of the set of is-graphs to construct a circularity 
test for attributed tree transducers that is based on the following lemma. We 
formulate the lemma in such a way that it directly yields the idea of the 
algorithm; that is, every universal quantification is taken over a finite set. In 
particular, the lemma does not refer to the infinite set of input trees (as Def. 
5.10 does). We will call it the circularity lemma. 

Lemma 5.17. A is circular if and only if there is a (t 6 with k >l and 
there are 251 , ... , 2 * 5 * £ is-setA) such that the graph Da(<7)[25i, . . . , 25 *] has 
a cycle. 

Proof. By our argument above, A is circular if and only if there is a tree 
s £Ts such that Da{s) contains a cycle. 

Hence, it is sufficient to prove that there is a tree s £Ts such that Da{s) 
contains a cycle if and only if there is a o’ G with fc > 1 and there are 
25i, . . . , 25* G is-setA such that the graph jDa(<t)[25i, . . . , 25 *] has a cycle. 

First we prove the ‘if’ clause. Assume that the graph Da(<7)[25i, . . ., 25 *] 
has a cycle. Then, by the definition of is-setA , there are trees 5i , . . . , 5 *, such 
that 25 a ( si) = iSjj for every 1 < 2 < fc. Moreover, by the definition of an 
is-graph, for every 1 < 2 < Ar, there is an edge in 25 a ( 5 ,) from an inherited 
to a synthesized attribute, if and only if there is a path in Da(s,) from the 
occurrence of that inherited attribute at the root of 5,- to the corresponding 
occurrence of that synthesized attribute. Hence, for 5 = < 7 ( 51 , . . . , 5 *), the 
graph Da{s) is also cyclic. 

Conversely, assume that there is a tree s £ Ts such that Da(s) con- 
tains a cycle. Let 5 ' be a subtree of 5 such that Da(s') contains a cycle 
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Fig. 6.5. The dependency graphs and 
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Fig. 5.6. The input tree #{< t { oi \,< t { oi 2 ,( t { oi 2 ^ oi 2 )))) with its dependency graphs 



and assume that s' is minimal with respect to its size. Then s' must have 
the form cr(si, . . . , Sk), for some k > 1, a E and si, . . . , sjk ETs Since 
DA{si)y . . . , DA{sk) do not contain a cycle, the cycle in Da(s^) should contain 
some edges of according to Def. 5.13. Then, since, for every 1 < i < fc, 

the edges of isA{si) represent the paths in £>A(«t) from occurrences of inher- 
ited attributes to occurrences of synthesized attributes at the root of S { , the 
graph , *«a( 5A:)] also contains a cycle. □ 

Thus we can design an algorithm (Fig. 5.7) which constructs step by step 
is-setA for an attributed tree transducer A and which simultaneously checks 
the condition of the previous lemma. For this purpose, we use a Modula2-like 
notation for the presentation of the algorithm. We note that, in contrast to 
attribute grammars, there is no distinction between trees of different types, 
that is trees with different nonterminal symbols at the root. Thus, in our 
case it is possible to use the usual set is-setA of is-graphs for A rather than 
a family of sets indexed by the set of nonterminals. 

Clearly, the algorithm terminates, since is-setA is a finite set. The algo- 
rithm discovers every is-graph of A, because for every s = a(s \ , . . . , sjb) G Tj; 
with height{s) = /, the graph 25 a (s) is calculated in the l-th run through 
the repeat- loop out of Da{(t) and 25 a(5i), . . • , 25A(5jb), or it is calculated 
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Input: Attributed tree transducer A = (Att^ U, A, aoiRj B) with 

Att = Attsyn U Attinh and dependency graphs Da{<t) for every 

ae B. 

Output: The answer “A is circular” or “A is noncircular” . 

Method: The algorithm needs the following variables: 

is-setA set of is-graphs of A 
is new constructed is-graph 

D new constructed directed graph 

change boolean information, whether is- set a has changed during 
the last run through the repeat- loop 

is-setA := 0 ; 
repeat 

change := false; 

for every a E ^ begin 

for every Ar-tuple (isi, . . . , isjk) with isj E is-setA do begin 
D := DA{(r)[isi,...,isk]; 
if there is a cycle in D then 
write(“A is circular”); 
stop 

else (* there is no cycle in D *) 
is := is(D); 
if is ^ is-setA then 

is-setA := is-setA U {z>}; 
change := true 
end (* if *) 
end (* if *) 
end (* for *) 
end (* for *) 
until change = false; 
write(“A is noncircular”) 



Fig. 5.7. Circularity test for attributed tree transducers 
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even earlier. Thus we conclude with the following observation concerning the 
correctness of the circularity test. 

Observation 5.18. The algorithm shown in Fig. 5.7 with input A and de- 
pendency graphs Da (o’) for every a E D yields the result 

• “A is circular” , if and only if A is circular, 

• “A is noncircular”, if and only if A is noncircular. □ 

Example 5,19. Consider an attributed tree transducer A with Attinh = 
{ 61 , 62 } and ATTsyn = { 01 , 02 }- Let S = be the input al- 

phabet. The dependency graphs of the input symbols are shown in Fig. 5.8 
(where we have omitted the lower lines of Da((^) and Da(/3))- We now show 
some initial steps of the algorithm in Fig. 5.7 when it is applied to A. 




Fig. 5.8. Dependency graphs of an attributed tree transducer 



is-setA — 0 
change = false 
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isi = (A«,{(6i,ai)}) 
is-setA = {*si} 
change = true 

for /? do 
for 0 ^ 

i«2 = {(62,02)}) 

is-seta = {»«l,»fi2} 
change = true 

for a do 

for (tsijisi) ^ 
iss = {Att, 0 ) 
is-SetA = 
change = true 

for (isi,is2) do 
is4 = (A«,{(6i,a2)}) 
is-setA = {isi,is2,*^3,««4} 
change = frwe 

/or (is2,*«i) do 
is^ = {(62,01)}) 

is-SetA = {i3l,l32,^«3,««4,i«5} 
change = frt/e 

/or (is2,*«2) ^ 
is4 = (Aft, 0) 

change = /a/«e 

/or a do 
for 0 do 

isj = (Aft, {(61, oi)}) = isi G is-setA 

for /? do 
/or 0 ^ 

238 = {Att, {(62,02)}) = is2 G is-setA 

for a do 

/or (2*31, 2Si) ^ 

2*39 = {Atty 0 ) = 2*33 G is- set A 



There are now in total 25 possible tuples (23, 23') with 23, 23' G is-setA- 
No combination yields a new element of is- set At hence change = false, the 
repeat-loop terminates, and the algorithm outputs that A is noncircular. □ 
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Definition of the tree transformation 

In the rest of this chapter, let A = (Attj i7, oq, R, B) be an arbitrary, but 
fixed, noncircular attributed tree transducer. 

Then we can prove that, for every input tree s, is confluent and 

terminating. 

Lemma 5.20. For every input tree s of Ay the derivation relation ^a,s is 
locally confluent on SF{Ay s). 

Proof. The proof of this lemma is very similar to the proof of the same 
property of the derivation relations induced by top-down tree transducers 
(cf. Lemma 3.9), and hence we omit it here. □ 

Now we show that, for every input tree s, the derivation relation =^a ,5 
is terminating on SF{AyS). First we make some observations about infinite 
derivations and we coin some useful notions. 

Definition 5.21. Let s eTs and (p G SF{Ay s). 

1. The sentential form (p is ^-reducible if there is an infinite derivation 

(p = (po =>A,8 ^2 =>A,8 • ‘ • 

2. The sentential form (p is minimal u)-reducible if there is no subtree p' of 

p such that p' is o;-reducible. □ 

Observation 5.22. Every o;-reducible sentential form has a minimal u- 
reducible subtree. □ 

Observation 5.23. Every minimal cj-reducible sentential form has the form 
c{w) for some attribute c and occurrence w. □ 

Recall that we are only dealing with noncircular attributed tree transduc- 
ers. 

Lemma 5.24. For every input tree s of A, the derivation relation =>a ,5 is 
terminating on SF{A, s). 

Proof. We prove this fact by contradiction. Let A be a noncircular attributed 
tree transducer and assume that there is an s G Ts such that =^a,8 is not 
terminating on SF{AyS). Then there is an u;-reducible sentential form p G 
SF{A, s). By Observation 5.22, p has a minimal a;-reducible subtree, say ^o,o 
and, by Observation 5.23, po^o has the form co(tt;o)- Hence, there is an infinite 
derivation 

^ 0,0 =>' A ,5 ^ 0,1 => A ,8 ^ 0,2 • • • 

(see Fig. 5.9 where is abbreviated to =>). Clearly, ^o,i is also an u- 
reducible sentential form and it has a minimal a;-reducible subtree, say ^i^o- 
There is an infinite derivation 
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y?l,0 ^A,s ^1,2 • • • 



where again, (pi^i is cj-reducible and it has a minimal a;-reducible subtree 
V^ 2 ,o- Obviously, this construction of minimal subtrees can be repeated and 
thereby we obtain two infinite sequences (v?»,o)»>o and (v?*,i)»>o of sentential 
forms such that, for every i > 0, the following three properties hold: 

1. (pi^o has the form Ci{wi) for some attribute c,* and some occurrence Wi of 
s, 

2. (pi^o V^*m- 

3. ^i+ 1,0 is a minimal ^-reducible subtree of (pi,ii hence pi^i = /?»[v?*+i,o] 
for some {Att U zi U occ(s), l)-context 




Fig. 5.9. Iterated construction of minimal w-reducible sentential forms 



This means that, for every i, j > 0 with j > i, there is a {Att U AU 
occ{s), l)-context I3ij such that 



<Pi,0 =^X> 
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where . .[/?;-i] . . .]]. Since every has the form Ci(wi) and 

since the set {c(it;) | c G Att, w G occ(s)} is finite, there must be indices io^jo 
with jo > io such that Cfo(ty,„) = cj„(wjJ. Hence, for 

This is a contradiction to the assumption that A is noncircular. □ 



Lemma 5.25, For every input tree s of A, the derivation relation ^a,s is 
confluent on 5F(A, s). 

Proof. This follows from Lemmas 5.20, 5.24, and 2.5. □ 



Lemma 5.26. Let s G Ts and (p G SF{A^s). Then the normal form 
n/(=^A. 5 ,V^) exists. 

Proof. By Lemmas 5.24 and 5.25, is terminating and confluent on 
SF{A^ s)y respectively. Hence, by Corollary 2.7, the normal form n/(^A, 4 , V^) 
exists. □ 

In the following we will prove predicates concerning attributed tree trans- 
ducers using a principle of simultaneous induction which is a slightly modified 
version of Principle 3.11. However, since the modification is very small, we 
will use the same name for the new principle. More precisely, we will define 
the notion “predicates K and L hold” to be that in Def. 3.10 except that the 
sets Bk now depend on an additional parameter. This parameter is a symbol 
a taken from i.e., we consider sets Bk,a with A: > 0 and <r G Since 
k equals the rank of (t, it suffices to show only the parameter a and omit the 
fc; hence, we consider sets Ba with a G for some fc > 0. This additional 
parameter, or better: information, is motivated as follows. In the application 
of the principle of proof by simultaneous induction as defined in Principle 
3.11, the label a (together with some state q) determines a right-hand side 
Then, after having determined ^ once in some proof, it is no longer necessary 
to know that ^ is the right-hand side of some (T-rule. This is due to the fact 
that top-down tree transducers and macro tree transducers work on their 
input trees in a strict, recursive descent manner. In a certain sense they con- 
sume their input trees. However, for attributed tree transducers the situation 
is different because computations may induce an arbitrary tree walk on the 
input tree. To be more precise, consider the local environment of a <7-labeled 
node n and assume that, for some synthesized attribute a, the value of the 
right-hand side ^ = r/is(a(7r), a) is under computation. Then, via some indi- 
rect dependencies, it may occur that the value of some inherited attribute b 
at some i-th descendant of n is needed. Clearly, this value is defined by some 
rule b{Tri) — ► and this rule must be taken from the set R^y i.e., the same 

set from which the rule a{ir) — ► ^ has been taken. This shows that, during 
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the proof of properties in the environment of a (7-labeled node, it is necessary 
to keep the information that the node is labeled by a. 

Definition 5.27. Let E he a, ranked alphabet. Let ^ be a set and, for 
every a E H, let Ka be a predicate over Ts^ i.e., a mapping of type Ts — ► 
{true, false}. Moreover, for every k >0 and a G let be a set and, 
for every b G Ba, let Lb be a predicate over i.e., a mapping of type 

(Ts)^ {true, false}. Let K = {Ka\a e H} a.nd L = {Lb\(r e E,b e B^}. 

1) We say that K holds on Ts, if for every a G H and s G Tjj, the term 
Ka{s) has the value true. 

2) We say that L holds on Ts, if for every k >Q, cr G b G Ba, and 

u> G (Ts)^, the term Lb{oj) has the value true. □ 

Principle 5.28. Let S, H, Ka, Ba, and Lb be given as in Def. 5.27. 

The principle of proof by simultaneous induction is: 

If 

IB: for every a G and b G Ba, Lb{{)) = true and 
ISl: for every 1: > 0, <t G E^^\ si,...,Sk G Tjj, if for every 
b G Bay Lb{{si , . . . , Sk)) = true, then for every a G H, 

. . . , Sk)) = true and 

IS2: for every k>l and s\, ... ,Sk ETs, if for every aG H and every 
i with 1 <i < k, Ka(si) = true, then for every a G E^^\ b G Bay 
Lb{{siy...ySk)) =true, 
then K and L hold on Ts. 



□ 

As in previous principles of simultaneous induction, the abbreviations 
“IB”, “ISl”, and “IS2” mean “induction base”, “induction step 1”, and “in- 
duction step 2” , respectively. 

In the next lemma we prove that the normal form of the sentential form 
a{e) with respect to some input tree s is a tree over the ranked alphabet A 
of output symbols and the set AtUnhii^})- It is clear that an occurrence b{e) 
with 6 G Attinh cannot be derived further, because this would require a node 
above the root of the input tree. 

In order to deal with the situation at a single node of a given input tree, 
we have to impose a linear order on the attribute occurrences. Then there is 
exactly one topological sorting of the set in{a) of inside attribute occurrences 
of a. In the sorting, the condensed dependency graph CDa{s) is taken into 
account. 

Definition 5.29. Let s = <r(si, . . . , sa?) G T^;. 

The condensed dependency graph of s for A, denoted by CDa{s), is a 
directed graph over att{(r) defined by induction on s as follows. 
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(i) For every s = or G 17^°^ define 

CDa{s) = Da (a). 

(ii) For every s = o-(si € Ts with > 1, define 

CDa(s) = DA(<r)[isA(si), isA{sk)]. 



□ 




Fig. 5.10. The condensed dependency graph 



Figure 5.10 shows the condensed dependency graph c^ 2 )) of 

the input tree (t{oc\ ^a 2 ) for the attributed tree transducer Ashift of Example 
5.4. 

Before studying the following definition, the reader is advised to consult 
Sect. 2.2 for the concept of topological sorting. 

Definition 5.30. Let s G Tx? and let < be a linear order on Att. 

The topological sorting of inside attribute occurrences of CDa{s) with 
respect to <, denoted by topsort{A^<^ s)^ is the string over in{a) which is 
obtained by applying the following two steps. 

♦ First, topologically sort the vertices of CDa(s), If there is a choice in 
ordering two attribute occurrences c{irr)) and c'(7rr;'), then sort depth-first 
left-to-right, i.e., if I < rj < t)\ then c{irrj) must occur before c'(7T7;'), and 
if > 1 and r;' = 0, then c{irrj) must occur before c^irr}^). If there is still 
a choice, then obey the linear order <. This yields a unique string w over 
att{(r). 

• Second, drop all the elements of out{a) from w. This yields topsort{A^ <> «)• 

□ 
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Note that every inside attribute occurrence in in{(r) occurs exactly once in 
topsort{A,<,s). Thus RHS{cr) = {rhs{w{u)y(r) 1 1 < < length(w)} where 

w = topsori(A, <, s). 

For instance, if 6 < oq < oi, then the topological sorting of inside at- 
tribute occurrences of CDA,Hifi{<^(o(,a)) in Fig. 5.10 is constructed as fol- 
lows. After the first step we obtain the sequence 6(7 t), 6(7t 1), ao(7rl), ai(7rl), 
ai(7r2), 6(7 t 2), ao(7r2), ao(7r), ai{w). Note that, after having placed b{7r) in 
the sequence, the topological sorting presents a choice between the attribute 
occurrences 6(7 t 1) and ai(7r2). According to depth-first left-to-right, we place 
6(7 t 1) first, followed by ao(7rl) and ai(7rl). After the second step we have: 
6(7 t1), 6(7t 2), ao(7r), ai(7r). 

Note 5.3L Due to the construction, w = topsort(A, <, s) will have the fol- 
lowing property. Assume that s = (t(si, ...,«*) for some ib > 0, (t G 
and si, . . . , sjb G Moreover, assume that a{ir) with a G Attayn (resp. b{irj) 
with 6 G Attinh) is an inside attribute occurrence of tr. Then every derivation 
a{e) ^ (resp. b{j) 0 should be such that ^ may contain an inside 
attribute occurrence of a only if that occurrence precedes a(7r) (resp. b{irj)) 
in w. 

If a(7r) = ti;(l) (resp. b{irj) = ty(l)), i.e., it is the first one in the linear 
order topsort{Ay <, s), then ^ cannot contain any inside attribute occurrence 
of a. 

Lemma 5.32. Let a G Aitsyn and s eTs- Then 
n/(=^i4,« i n(^)) G ^^(Att»n/j({^})). 

Proof. By simultaneous induction (using Principle 5.28), we prove that the 
following predicates K and L hold on Then, from property K the state- 
ment of the lemma follows. 

K\ For every a G Aitsyn and s G Ts, n/(=>A, 5 , o(e)) G TA{Atiinh{{e})). 

L: For every A: > 0 and <r G $ G RHS{cr)y and (si, . . . , s*) G (J^)*, 

n/(=>A,<7(«i,...,5jc)»^[^ ^ ^]) ^ 'I'A{Attinh{{€})). 

To be more precise, the metavariables Hy Ka, Bay and L\, which occur in 
Principle 5.28, are instantiated as follows: 

• ^ = Attsym 

• for every a G Attsym Ka is the predicate over Te such that Ka{s) = true 
if and only if n/(=j^^,,, a(£:)) G T^{Attinh{{e})), 

• K = {Ka I o. G Attsyn} 

and, for every k >0 and a G 
^ Ba = {(t} X RHS{(r)y 
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• for every ^ 6 RHS{(t), is the predicate on such that 

if and only if n/(=>-^,<,(„,...,,^),^[7r e]) G T 4 (>!«,„/.({£})), 

• L = {•£'(<t,{) I O’ G 27, (o’,^) G Bff}. 



Proof of IB: Let tr 6 1^°) and 4 G RHS{<r). Then ^ G r4(^«<„h({7r})) 
and n/(=»A,<T, ^[5T <- e]) = |[7 t ♦- e] G T^iAttinhUe}))- Hence, L(<,.f)(Q) = 
true. 

Proof of ISl: Let k >0, a E and si, . . . , sjb G Ts. Assume that for 
every ^ E RHS{<r)y L(^^^)((si, . . .,Sjb)) = true; clearly this is the induction 
hypothesis on L. Let a G Attgyn^ Since a{e) =^A,a(»i,...,«fc) rhs{a{Tr),a)[w <— 

e], we obtain n/(=>-A,„(„ = «/(=>-A,<r(»i ,*),rAs(a(ir),(r)[5r 

e]). Hence, by induction hypothesis on L, nf(=^A,a(8i,...,8k)i rhs{a{ir)y (t)[tt ^ 
^]) G T^{Attinh{{^})) which proves that Ka{(r{8i, . . .,sjk)) = true. 

Proof of IS2: Let k > 1 and si,...,sjb G T^. Assume that, for every 
a G Attsyn and every I <i < k, n/(=>^,,., a(£)) G TA{Attinh{{e})); clearly 
this is the induction hypothesis on K. Then we have to prove that, for every 
<T G and ^ G RHS{<r), ^]) ^ ^4(i4«i„h({e}))- 

Let < be any linear order on Att; let w denote topsort{A^ <, ( 7 ( 51 , . . . , sjk)). 
Then we prove by (finite) mathematical induction on u the following state- 
ment. 

(♦) For every 1 / with 1 < < length{w), 

nf{^A,a(8^,...,8t,)>rh8(w{u),a)[w ^ e]) G >»/»({£:})). 

Since RHS{cr) = {rh8{w{i/), o’)\l < 1 / < length{w)}^ this also proves L. 

Induction base of (♦) We prove the statement by structural induction on 
rh8(w{l),(r) where we represent su6(rAs(iy(l), (t)) by SUB. 

(i) If d{irj) G SU B with d G Attgyn and 1 < i < A;, then by the assumption 
of IS2, i.e., by induction hypothesis on K, nf{=^>^A,8jid{e)) G TA{Attinh{{€}))^ 
Moreover since 1 / = 1, it follows that nf{=^A,8j,d{e)) G T^, because otherwise 
some b{e) G Attinh{{^}) occurs in this normal form and hence u;(l) is not the 
first symbol of top8ort{A, <, a{8i , . . . , 8k)) which is a contradiction, see Note 
5.31. Since nf{=>A,8j,d{e)) G T^, we also have nf{=^A,a{8u...,8k)yd{irj)[ir ^ 
^]) G which proves the statement. 

(ii) If 6(7 t) eSUB with b G AtUnhy then nf{z^A,a(8i,...,8k)^K^)[^ ^ ^]) = 
b{e) G TA{Attinh{{^}))- 

(iii) If G sub, then 5(^i, ,. .,^i)[?r ^ e]) 

= 6{nf{=>A,a{8u...,8k)ij^i[^ ^ ^D)- Since, 

by induction hypothesis on this structural induction, every nf{::^A,a{8i,...,8k) 
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,^j[7r ♦- e]) € T 4 (^W.nh({e})), where 1 < j < A:, we obtain n/(^A,< 7 (»i .*) 

. ^(6, • • • £]) € r4(^«,„h({e})). 

Induction step of (*) Assume that (♦) holds for every k with I < k < i/. 
We prove the statement by structural induction on rhs{w{i/ + l),<r) where 
we represent sub{rhs{w{i/), a)) by SUB. 

(i) If d{Trj) £ SU B with d £ AU^yn and 1 < i < Ar, then by the assumption 

of IS2, i.e., by induction hypothesis on K, nf{=>A,8jid{e)) £ TA{Attinh{{s})). 
Hence there is an m > 0 and a (^, m)-context /? such that n/(=^A, 5 j 
, d(£r)) = /3[bi{e), . . . , 6m (^)] for some 6i, . . . , 6m G AtUnh- Since (♦) is proved 
by mathematical induction on the positions in topsort{A^ <, a{si , . . .,sjk)), 
for every p with 1 < /i < m, there is an index with 1 < < i/ such 

that w{Kfji) = b^{'irj). Hence, by induction hypothesis on (♦), we know that 

^ ^]) ^ TA{Attinh{{e})i Then the fol- 
lowing statements are true, where we abbreviate to =>«: 

nf{=>,d{irj)[7Ti^ e]) = 

= nf(=>,nf{=fi^sj,d{e))[b{e) ^ b{j)]) 

= n/(=j^, /?[6i(e:), . . . , bm{e)][b{e) ^ b{j)]) 

= n/(=>,^[6i(j),...,6mO*)]) 

= 6i(7Tj)), . . . , n/(=>, 6m(7Tj))] 

= rhs{bi{TTj), ar)[ir 4 - e ]), . . . , n/(=>, rAs( 6 m( 7 rj), (r)[7r ^ e])] 

£ T^{Attinh{{^}))- 

(ii) and (iii) are proved in the same way as in the induction base. 

This finishes the proof of statement (♦) and of IS2. Since IB, ISl, and IS2 
are proved, this shows that K and L hold onTs. □ 

We can now define the tree transformation induced by A. 

Definition 5.33. Let E be the environment of A. 

(a) Let a be a synthesized attribute. The tree transformation induced by A 
with a is the mapping : Ts — ► TA{Attinh{{e})) which is defined, for 
every s G Ts, by rj"(s) = n/(=^,i,,,a(e)). 

(b) The tree transformation induced by A is the mapping r*’’ : Ts —* 

Ta such that, for every s S Ts, Ta*''(s) = where E : 

T^{Attinh({£})) —* T^ is defined by structural induction as follows. 

(i) For every b € AtUnh, E{b(e)) = E{b). 

(ii) For every S 6 with / > 0 and (pi,...,<pi G •„;,({£})), 

E{S{<pi,...,<pi)) = 6{E{(pi),...,E{(p,)). □ 
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We call a tree transformation r an attributed tree transformation^ if r can 
be induced by some attributed tree transducer. The class of attributed tree 
transformations is denoted by ATT, 

Example 5.34- Consider again the attributed tree transducer Ashift in Ex- 
ample 5.4. It can be proved by induction on n that, for every sequence 
where 1 < ii,...,in < 2 

. • • • . «.») • • •))) = 

#(<t(<t(. . a, •„)). 

Roughly speaking, takes “right-growing” combs of E trees into “left- 

growing” combs in such a way that it does not change the order of the leaves. 

□ 



5.3 Characterization of Attributed Tree Transformations 

As in Sects. 3.3 and 4.3 for top-down tree transducers and macro tree trans- 
ducers, respectively, we provide here an inductive characterization of at- 
tributed tree transformations. We also have to modify slightly the principle 
of definition by simultaneous induction (Principle 3.20) so that it is suited 
to the context of attributed tree transducers (see the discussion after Lemma 
5.26). However, as for the principle of proof by simultaneous induction, we 
use the same nomenclature as in Principle 3.20. 

Definition 5.35. Let i7 be a ranked alphabet, let H and C be sets, and for 
every (t G X*, let Ba be a set. Moreover, for every a E let fa :Tjj ^ C he 
a function and, for every Ar > 0, (r G and b £ Ba, let gi, : (Ts)^ C he 
a function. 

Let 

F = {fa:Ts^C\aeH} 

and let 

G = {gi, : (Tr)‘ ^ Cjib > 0,<r € G Ba}. 

We say that F is defined if, for every a E H and s E TjSi fa{s) is defined. 
Similarly, we say that G is defined if, for every A: > 0, G b E Ba, and 
si,.,,,Sk ETj!, 9b{{si,.,.,8k)) is defined. □ 

Principle 5.36. Let F and G be two families of functions as in Def. 5.35. 
The principle of definition by simultaneous induction is: 

If 

IB: for every a E and b E Ba, the value fl^6(()) is defined, and 
ISl: for every Ar > 0, <r G E^^\ «i, • . ,sjb G Tjj, and a G if for 

every b E Ba, the value gb{{si, . . . , Sjb)) is defined, then the value 
/a(<^(«i, • • • j «*)) is defined, and 
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IS2: for every A: > 1, o' E «i, • • and h E if for 

every a £ H and 1 < i < A;, the value fa{si) is defined, then the 
value , Sk)) is defined, 

then F and G are defined. 



□ 

We use this principle of simultaneous induction to define sets of map- 
pings which characterize the derivation relation of attributed tree transduc- 
ers. Recall that A denotes an arbitrary, but fixed, attributed tree transducer 
(Aft, oq) F) ^)* 

Definition 5.37. Let < be a linear order on Att. We define the set 
F = {r<,A,a • F^{Attinfi{{€})) | d E Attsyn} 
of functions and the set 

G<,a = : (Tr)* ^ T^{AtU„H{{e})) | fc > 0, (t € G RHS{(t)} 

of functions by simultaneous induction according to Principle 5.36 where we 
have instantiated the sets as follows: 

• H — Attgyfi 

• = {o'} X RHS{(t) for every a £ E. 

IB: For every a £ E^^^ and ^ E RHS{<r), 

ISl: For every A: > 0, o' E E^^\ and si, . . ., s* £ Tsy and a £ Attgyn, we 
define 



IS2: Let Ar > 0, o' E E^^\ and si, . . . , Sjk E T^;. Let w = topsort{A^ <, s) 
with s = o'(si, . . . , Sjb). By (finite) mathematical induction on i/ with I <i^ < 
length{w), we define where u = (si, . 

Induction base i/ = 1: The value T*^^A,( 7 ,rhs{w(i),a)(^) defined by struc- 
tural induction on rhs{w{l),a). Let SUB represent sub{rhs{w{l) ^ a)) . 



(i) Let d{m) £ SUB for some d £ Attgyn and 1 < 2 < A?. Since 2 ^ = 1, the 
value G T^; in particular, this value does not contain inherited 



attributes. Thus we can define a 



-ind 



•<,A,di^i)' 

(ii) Let 6(7 t) £ SUB for some 6 E AtUnh- Then define ^ “ ^(^)* 

(iii) Let 6(^i, ... ,6) ^ SUB for some 6 £ ^ > 0, ^ 1 , . . . ,^/ E SUB. Then 

define ’ • ’ > 
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Induction step i/ — » + 1: Assume that for every k with I < k. < u, 

the value < 7 rh 5 (u;(K) already defined. Then we define the value 

^<rl r/»a(t/;(i/+i) < 7 )(^) Structural induction on rhs{w{i/ + Let SUB 

represent sub{rhs{w{i/ + 1), cr)). 



(i) Let d{Tri) G SUB for some d G Attsyn and 1 < i < k. By the application 
of ISl, the value d(^») is already defined. Then there are an m > 
0, a (^i, m)-context /? and 6i,...,6y„ G AiUnh such that = 

/3[bi{e ), . . . , 6m(^)]« According to Note 5.31, for every fi with I < fi <m 
there is an index /c^ with I < such that w(k^) = 6^(7ri). Hence, 

by induction hypothesis, a rhs{b^(iri) already defined. Thus 



we define 
m]. 



^ind 
<,A,a,rhs(w(Kp 






;i </^< 



(ii) and (iii) are the same as in the proof of the induction base. 



□ 



Note that IB is part of IS2 for i = 0. 

Intuitively, it is clear that the functions depend 

on the order <. Instead, < only infiuences the way in which the functions are 
defined. More precisely, if for some input symbol a, there are two attribute 
occurrences and c'{irri) which do not depend on the choice of the order 

<1 or <2 with c <1 c' and c' <2 c, respectively. This is stated explicitly in 
the following lemma. 



Lemma 5.38. Let <1 and <2 be two linear orders on the set of attributes 
of A. Then 

. for every a G Att,yn, = ^</,A,a 

. for every k>0,<re r(‘), ^ G RHS{<t), . □ 

Thus, in the following, we will drop the linear order on the set of at- 
tributes from the denotation of the r*” ^-functions. Moreover, we show that 
the tree transformation induced by an attributed tree transducer A 
coincides with the inductively defined function To prove this equiva- 
lence, we introduce, for every a G with k > 0 and ^ G RHS{a), the 
function : (Tj;)* TA{AtUnh{{e}) such that . . . , Sjk)) = 

^ ^])* 

Theorem 5.39. K: For every a G Attsyn and s G Ts, 

L: For every A: > 0, cr G ^ G RHS{a), and u; = (si, . . .,sjb) G (Tj;)*, 

Proof. The proof of these statements follows the principle of simultaneous 
induction in Principle 5.28. Actually, the structure of this proof is the same 
as the structure of the proof of Lemma 5.32 and thus we omit it here. □ 




5.4 Height and Subtree Properties 165 



As in the case of top-down tree transducers, we will now drop the super- 
scripts ind and der from the denotations of the and the ^-functions. 



5.4 Height and Subtree Properties 

In this section we estimate both the height of an output tree and the number 
of subtrees of an output tree with respect to the size of the corresponding 
input tree. 

Lemma 5.40. There is a constant c > 0 such that, for every s £ Ts, 
height{rA{s)) < e size{s). 

Proof. Consider an input tree s E Ts and the set Att{s) of attribute in- 
stances of s, i.e., Ait{s) = {c(u;)|c G Att,w G occ(s)}. Since A is noncir- 
cular, the set Ati{s) can be sorted topologically such that, if c{w) depends 
(directly or indirectly) on c'(u;'), then c'(u;') occurs before c{w) in this linear 
order. Let sort = ci(u;i), C 2 (u; 2 ), • • , Cn{wn) be such a linear order. Note that 
n = card{Att) • size{s), where card{Att) is the number of attributes. 

Now it is easy to observe that the highest output tree is achieved if the 
right-hand sides of the defining rules of the attribute instances in sort are 
put on top of each other. 

Then we can prove the following statement by mathematical induction. 

For every i with 1 < i < n, height{nf{=^A,8yCi{wi))) < i • 
max{height{RH S)), 

where max{height{RHS)) denotes the number max{height(^)\^ G 
RHS{cr),aeS}. 

i — 1: Since ci(u;i) does not depend on any other attribute, the statement 
follows immediately. 

z 2 -f 1: We have to distinguish two cases: either c»+i is a synthesized 
attribute or Cj+i is an inherited attribute. 

Ci^i G Attgyn : Let label(s^Wi^i) = a and let ^ = rhs{ci^i{n)ya). Then 
there is an m > 0 and a {A, m)-context and there are attribute occurrences 
di(7Tf7i),...,dm(’r»7m) € att{a) such that ^ = j5[di(ir»7i), . . By 

the definition of sort, every dj{wi^ir)j) occurs before c,+i(u;,+i) and hence, 
we can assume that the statement is true for these occurrences. Then we can 
compute as follows: 

height{nf{=!>-A,s , c,+i(iu,+i))) 

= height[nf{^A ,$ , /?[di(«'<+i»?i), • • • , rfm(i«.+i»7m)])) 

< height{0) + max{height{nf{^A,> , dj{wi+iT)j))) 1 1 < J < m} 

< max{height{RH S)) + max{i • max{heighi{RH S)) 1 1 < J < ni} 

= (i + 1) • max(height(RH S)). 
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Cj+i G Aitinh • If ^^»+i = then the statement is true trivially. Now 
let Wi^i = vl for some v G occ{s) and / > 1. Let label(s,v) = <t and 
let ^ = rhs(c,.j.i(7r/), O'). Then there is an m > 0 and a (^, m)-context /?, 
and there are attribute occurrences di(7T7yi), . . . , dm(7r?7m) ^ aU{(r) such that 
^ = P[di{Tr'qi )^ . . . , By the definition of sort, every dj{vrjj) occurs 

before and hence, we can assume that the statement is true for 

these occurrences. Then we can compute as follows: 

heighi{nf{=^i^A,3 , Ci+i(u;,>i))) 

= height{nf{=>A,s,/3[di{v7]i), . . . , dm{vr)m)])) 

< height{l3) + max{height{nf{=i>^A,sidj{vrjj))) 1 1 < j < m} 

< max{height{RH S)) + max{i • max(height{RH S)) 1 1 < i < m} 

= (i + 1) • max{height{RH S)). 

This proves the statement. By choosing i = n and realizing that n = 
card(Att) • size{s), we obtain the following: 

heighi{rA{s)) 

= height{nf(=^A,s , ao{e))[b{e) ^ E{b); b G AtUnh]) 

< height{nf{=>A,s > c^o(^))) + max {height (E{b)) | b G AtUnh} 

< card(Att) • size{s) • max{height{RH S))-h 
max {height (E(b)) 1 6 G AtUnh} 

< c • size{s) 

where c = card(Att) • max(height{RH S)) 4- max {height (E{b)) \ b G AtUnh- 
This proves the statement of the lemma. □ 



Corollary 5.41. There is a constant d > 0 such that, for every s £ Tjj, 
size{TA{s)) 

Proof. By Lemma 2.13, there is a constant e > 0 such that size(t) < 
^hetght(t) every t E Then, by Lemma 5.40, we obtain size{rA{s)) < 
^hetght(TA(8)) ^ ^c-stze(s) ^jjere c is the constant of Lemma 5.40. Then it is 
easy to see that there is a constant d such that size{TA{s)) < □ 



Lemma 5.42. For every r G ATT^ with k > 1 there is a constant c > 0 
such that, for every s G the approximation size{r{s)) < exp{k, c-size{s)) 
holds where, for every fc, n > 0, the number exp{k, n) is defined by induction 
on k such that exp{0, n) = n and exp{k + 1, n) = 2®^^^*’”). 

Proof. The proof is completely analogous to the proof of Lemma 4.24. □ 
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In a similar way we can estimate the number of different subtrees of an 
output tree. 

Lemma 5.43. There is a constant c > 0 such that for every s G Tj;, 
card(sub{TA{s))) < c^size{s). 

Proof. Consider an input tree s E Tjj and again the set Att{s) of at- 
tribute instances of s, i.e., Att(s) = {c(u;) | c G Att^w G occ(s)}. Let 
sort = ci(wi),C 2 {w 2 )i . • . iCn{wn) be the linear order of the attribute in- 
stances as in the proof of Lemma 5.40. Again let n = card{Att) • size{s)^ 
where card(Att) is the number of attributes. 

Now we prove the following statement by mathematical induction on i 
where max{size{RH S)) denotes the number max{size(^) G RHS{<t),(t G 
E}. Moreover, we represent by n/(^). 

For every i with 1 < * < n, card([J^^j^- sub{nf(cj{wj)))) < i • 

max{size{RH S)). 

1 = 1: The statement follows immediately. 

2 — » 2 -f 1: Again we have to distinguish two cases: either is a synthe- 
sized attribute or c,+i is an inherited attribute. 

Ci^i G Attsyn • Let label(s^Wi^i) = <r and let ^ = r/is(c,+i(7r), cr). Then 
there is an m > 0 and a {A^ m)-context /?, and there are attribute occurrences 
diiiTTfi ), . . . , dmiTrrjm) G att{a) such that ^ = /?[di(7T7;i), . . . , dm{Tn]m)]- We 
note that, due to the definition of sort, every dj{wi^i7}j) occurs before 
c*+i(u;i+i). Then we can compute as follows: 

ca»-d(Ui<i<i+i sub{nf{cj{wj))) ) 

= card(sub(nf(ci+i(wi+i))) U Ui<;<,- ) 

= card(sub(<p) U Ui<j<< sub(nf{cj (wj))) ) 

where (p = 0[nf{di{wi+iT]i)), n/(dm(«'i+i>?m))] 

= card{(sub{ip) — M) U M U Ui<j<« sub{nf{cj{wj))) ) 

where M = Ui</<m since M C sub{<p) 

= card{{sub(<p) - M) U Ui<i<i sub(nf{cj {wj ))) ) 
since every dj{wi^i7]i) occurs before c,-f.i(u;i+i), 

< card{sub{(p) — M) + card{\J^^-^^ sub(nf{cj{wj)))) 

< card{sub{/3)) + card{\J^^j^. sub{nf{cj{wj)))) 

< max(size{RH S)) + i • max{size{RH S)) 

= {i + 1) • max{size{RH S)). 

Cj+i G Attinh • This case is proved in a similar way to the first case. 
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This proves the statement. By taking i = n and again realizing that 
n = card{Att) • size{s)^ we obtain 

card{sub{rA{s))) 

— card{sub{nf{ao{e))[b{e) ^ E{b); b G AtUnh])) 

< card(Ui<^.<„ sub{nf{cj {wj))) + EbeAttt„, size(E(b)) 

< card(Att) • size(s) • max(szze(RffS)) + J2bsAttinh 

< c • size(s) 

where c = card(Att) • max{size{RH S)) + ^*^e(£'(6)). This proves 

the statement of the lemma. □ 

An immediate implication of the lemma above is that attributed tree 
transducers are more powerful than top-down tree transducers. 

Theorem 5.44. TOP C ATT. 

Proof. From Note 5.3 it is clear that TOP C ATT. Now consider the fol- 
lowing attributed tree transducer A = {Att^ i7, oq, iJ, E) with 

• Att = Attsyn U Attinh with Attsyn = {ao} and AtUnh = {b} 

• R = RffU Ra where 
Rfj contains the rules 

1) ao(7r) cr(ao(7r2),ao(7r2)) 

2) 6(7t1) — > b{ir) 

3) 6(7t 2) — ► ao(^l), 

and Ra contains the rule 

4) ao(7r) <T(6(7r),6(7r)), 

• E{b) = a. 

It is easy to observe that 



TA 2 {(«n,S 2 -)|n > 1} 

where Sn is the fully balanced tree over E defined in Example 3.29. 

Now assume that ta G TOP. Then, by Lemma 3.27, the height of an 
output tree of ta is linearly bounded by the height of the corresponding 
input tree. However, this contradicts our observation and hence, ta i TOP. 

□ 



5.5 Composition and Decomposition Results 

As in the case of macro tree transducers, attributed tree transducers also 
form a strict composition hierarchy. 
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Theorem 5.45. ATT^ C for every fc > 1. 

Proof. By composing (A: + l)-times the attributed tree transducer of the 
proof of Theorem 5.44, we obtain a tree transformation in ATT^~^^ which 
cannot be in ATT^, by Lemma 5.42. □ 

Finally we want to show that ATT is closed under right composition 
with TOPy i.e., ATT oTOP C ATT. The result generalizes Theorem 3.39 in 
which TOP o TOP C TOP has been proved. The idea behind the construc- 
tion involved in Theorem 3.39 can be used again: it is a common product 
construction. 

Lemma 5.46. ATT o TOP C ATT. 

Proof. Let A = {AUa, TJy A,aoy Ra, Ea) be a noncircular attributed 
tree transducer and let T = (Q, X2, g, i?T) be a top-down tree trans- 

ducer. We construct a noncircular attributed tree transducer B = 
(AUb^E, Qy (g, ao)y Rsy Eb) such that taovt = tb- 

First we modify T to be the top-down tree transducer T' which can 
transform the right-hand sides of rules in Ra- For this purpose, let mx = 
max{k I 0} and define the two ranked alphabets 

AttA{7^, mx) = {a(7Tz) | a G {AttA)8yn, 1 < * < ma:}U{6(7r) | b G {AttA)inh} 

and 



Q X AttAi'^y mx) = {(g, a){m) \ g G Q, a G {AitA)syn, 1 < * < mx} 

U{(g,6)(7r) I q£Qybe{AttA)inh} 
where every symbol has rank 0. 

Now construct the top-down tree transducer T' = {Q,A U 
AttA{^^^ mx)y f2\jQ X AttAi'^y mx)y g, Rij,) by defining 

R!j^ = RTU{q{a{7ri)) {qya){wi) | g G Q, a(7T2) G mx)} 

U{g(6(7r)) (g, 6)(7 t) | g G Q, b{ir) G A«a(7t, mx)}. 

Note that “a(7T2) G AttA{T^ymxY is shorthand for “u; G AttAi'^^mx) and w 
has the form a(7ri)” and similarly for “6(7 t) G a(^, mx)” . 

Next we construct B = {AtiByEy i?, (g, ao), Eb) as follows: 

• AUb = {AttB)syn U (AttB)inh where {AttB)syn = Q X {AttA)syn and 
{AttB)inh ^ Q X {AttA)inh‘ 
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• Rb is the smallest set satisfying the following conditions. For every G 
for every q GQ, and 

- for every rule a(7r) — ^ in {Ra)^^ the rule (^,a)(7r) — rr\q{0 is in 
{Rb)<7, and 

- for every rule b{7ri) — v ^ in {Ra)^^ the rule {q,b){7ri) is 

{RB)a- 

• Eb is the function Eb • {AitB)inh such that, for every (g,6) G 

{AttB)inh, EB{{qyb)) = vr^q{EA{b)). 

The following intermediate statement can be proved by structural induc- 
tion on 

For every g G Q, ^ G RHS{{AtiA)8yn, {AttA)inh, A k) with 0 < A: < 
mx, and set {ta,j E Ta | a G {AttA)8ynA < j < k} U {U | 6 G 
(^AttA^inh}- 

^ ^aj]a{7Tj) G AttA{7r, mx)][b(ir) U; 6 ( 7 t ) G AUa{tt, mx)]) = 
^ ^A^a,j);{p,a){irj) eQx AttA{Tr,mx)] 

[(P) ^)(^) ^ Pr,p(^fc); (p, b){TT) eQx A«a(7t, mx)]. 

Using this intermediate statement, the fact that the predicates K and L 
hold on Tsi can be proved by the principle of simultaneous induction. 

K: For every s eTjj, a e {AttA)syn, q ^ Q, and E : {AitA)inh ^ Ta, the 
equation 

Pr,g(rA,a(s)[K^) ^ E{b);be = 

TB,(q,a){s)[{p,b){e) ^ 7^,p(£'(6)); p G Q,6 G {AttA)inh] 

holds. 

L: For every k > 0, a e ^ G RHS{a), u G {Ts)^, q ^ Qi and 

E : {AttA)inh Ta, the equation 

TrA'^A,aA^)[K^) ^ E{b);be {AttA)inh]) = 
TB,a,^'i(^)[{p,b){€) ^ VrAE{b))]P eQ,be {AttA)inh] 

holds, where = '^r^g(0 
Theorem 5.47. ATT o TOP = ATT. 

Proof. The statement follows immediately from ATToTOP C ATT (Lemma 
5.46), the fact that TOP contains the identical tree transformations, and 
Lemma 2.14. □ 
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5,6 Bibliographic Notes 

Attributed tree transducers were introduced in [FiilSl] (see also [BarSl, 
Bar83]). The composition result ATT o TOP = ATT (Theorem 5.47) is due 
to this paper. The proofs of Lemmas 5.40 and 5.43 are due to [Kiih95a]. 

We have defined the concepts of a dependency graph, an extended de- 
pendency graph, and an is-graph for attributed tree transducers in a very 
similar way to the corresponding concepts for attribute grammars (see, e.g., 
[Cou84, Eng84]).The circularity test is also similar to the circularity test for 
attribute grammars [ASU86] (see also [KV94b]). The result concerning the 
intrinsic exponential circularity test is due to [JOR75], see also [Jaz81]. The 
strictness of the composition hierarchy of attributed tree transducers is due 
to [Fiil81], see also [Eng81]. 

In [Eng84] attributed tree transducers are used as a formal model for the 
computation of attribute values in an attribute grammar (in program P4). 

In [FB[VV93] attributed tree transducers have been considered as program 
schemes in the sense that some of the output symbols can be interpreted in 
a semantic domain with the set of trees as the carrier set and tree functions 
as operations on the carrier set. There it is proved that the class of primi- 
tive recursive tree functions (cf. [Hup78]) is closed under the application of 
attributed tree transducers viewed as program schemes. 

In [Gie88] attribute coupled grammars are introduced. It is shown that, 
under a certain condition (called syntactic single use requirement), the com- 
position of two attribute coupled grammars is again an attribute coupled 
grammar. 

A pumping lemma for output tree languages of attributed tree transducers 
has been investigated in [KV94a] (see also [Man96]). 




6. Compcirison of Induced Tree Transformation 
Classes 



In the previous two chapters we studied two different formal models, i.e., 
macro tree transducers and attributed tree transducers, which extend top- 
down tree transducers in the sense that they can handle context information. 
In this chapter we aim to compare the power of macro tree transducers and 
attributed tree transducers. We collect all the inclusions which relate the 
classes TOP, YIELD, ATT, MAC, and their subclasses into an inclusion 
diagram. Moreover, we present the composition semigroup which is generated 
by the set {TOP, ATT, MAC}. Hence, the structure of this chapter is as 
follows. 

1. Comparison of MAC and ATT 

2. Inclusion diagram based on TOP, YIELD, ATT, and MAC 

3. Composition semigroup generated by TOP, ATT, and MAC 

4. Bibliographic notes 



6.1 Comparison of MAC and ATT 

Every attributed tree transducer A can be simulated by a macro tree trans- 
ducer M: in M the synthesized attributes of A are considered as states, 
and the inherited attributes of A are simulated by parameters of states. 
Roughly speaking, for every state a and input symbol a G the right- 

hand side of the (a, (r)-rule is obtained by unfolding the dependency graph 
DA{o‘)[isf,...,isf] where isf contains an edge between every inherited at- 
tribute and every synthesized attribute. Thus, isf covers every possible de- 
pendency. The unfolding process is indicated in Fig. 6.1 where in (a) the out- 
put symbols occurring in the right-hand sides of the rules of A are integrated 
into the dependency graph. The unfolding terminates whenever an outside 
attribute occurrence is encountered for the second time (as, e.g., a(7rl)). 

Lemma 6.1. ATT C MAC. 

Proof. Let A = {Att,E,A,ao,R,E) be an attributed tree transducer with 
Ati = Atisyn U Attinh- We construct a macro tree transducer M such that 
ta = Let (6i, . . . , br) be a linear order of the inherited attributes of A. 
Define M = (Q. E. A. an, iZ', E'^ as follows: 
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• Q = I a € Att,yn} 

• ii' is the smallest set which contains the following rules: if a(7r) — ^ ^ is in 
Rff for some a G Aitsyn and a € with fc > 0, then the rule 

a{a{xi , . . . , xjk), yi , . . . , yr) 

is in iJ' where is obtained from ^ by performing the following substi- 
tutions: 

- every a'(7rz) with a' E Attsyn and 1 < i < fc is replaced by 

a'{xi,t{i, 1, {(a',i)}}, . {(a', t)})) 

- every bj (tt) is replaced by yj . 

For every i/ E {1, . . ,fc}, « E {1, . . .,r}, and P C Attsyn x {1, . . .,fc}, we 
define recursively the tree t{i/, /c, P) as follows. Let s = r/is(6,c(7ri/), a), 

- If, for some (c, /) E P, the subtree c(7t/) occurs in s, then define t{i/, /c, P) 
= a for some a E 

- Otherwise, t(z/, «, P) is obtained from s by performing the following sub- 
stitutions: 
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• every subtree c{irl) for (c, /) G (Attsyn x {Ij • • • , ^}) — P is replaced by 
where P' = P U {(c, /)} 

# every subtree bj{Tr) for bj G AtUnh is replaced by yj, 

• P' = (P(6i),...,P(6,)). 

This completes the construction of M. We add two remarks concerning the 
termination of the construction of the right-hand sides and about the role of 
a. During the construction of the right-hand side 0(^), several terms /c, P) 

have to be constructed. In fact, the construction proceeds in a recursive way, 
and in every recursion step one pair (c, i) is added to the set P. Recall that 
(c, i) represents the outside attribute occurrence c{iri). However, since there 
is only a finite number of such outside attribute occurrences, the recursion 
has to stop eventually. 

What about the role of a E A? The term t{u , «, P) is defined to be a, if, 
for some (c, i) G P, the outside attribute occurrence c{Tri) occurs in the right- 
hand side rhs{bK{in/)^ a). The fact that (c, i) G P means that the construction 
process has visited this outside attribute occurrence c(7ri) before. However, 
if during the computation of A on a particular input tree s, the attribute 
occurrence c(7ri) is encountered twice, then A is circular and this contradicts 
the assumption that A is noncircular. Hence, it is reasonable to stop the 
construction of the terms at this point by placing an arbitrary 

symbol into 0(^): it will not be used in the computation of M anyway. 

We can prove the correctness of the construction by proving the following 
statements by simultaneous induction according to Principle 5.28. 

K: For every a G Attsyn and s ETs, TA,a{s)[bj{e) ^ yj]l < j < r] = 
TM,a(s)- 

L : For every k > 0, a E ^ E RHS{a), and a; = (si, . . . , sjb) G (Tj;)*, 
TA,aA^)[bj{e) ^ yj;l < j < r] = where 0{l^) is defined as in 

the definition of P'. □ 

Now the question arises whether this inclusion also holds in the other 
direction. It does not. In fact, macro tree transducers have more transfor- 
mational power than attributed tree transducers. This is due to the different 
capabilities of increasing the height of input trees. For attributed tree trans- 
ducers, the height of an output tree is linearly bounded in the size of the input 
tree, whereas in Chap. 4 we presented an example of a macro tree transducer 
which produces output trees of which the height is exponential in the height 
of the corresponding input trees. 

Lemma 6.2. ATT C MAC. 
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Proof. In Lemma 4.23, it was shown that there is a macro tree trans- 
ducer Mexp with a monadic input alphabet such that, for every input tree s, 
height{TM^^j,{s)) = Clearly, since the input alphabet is monadic, 

size{s) is equal to height{s). Now it follows from the height-lemma of at- 
tributed tree transducers (Lemma 5.40) that there is no attributed tree trans- 
ducer A such that ta = . D 

We can even define a small subclass of MAC which contains tree trans- 
formations that cannot be induced by any attributed tree transducer. This 
subclass is induced by so-called basic macro tree transducers where, in the 
right-hand sides of rules, no state may occur nested in the parameter position 
of another state. 

Definition 6.3. Let M = (Q, i7. A, go, Ry E) be a macro tree transducer. M 
is basic if, for the right-hand side ^ of every rule in iZ, the following holds: if 
• • *,^r) is a subtree of ^ with q E Q, then, for every 1 < j < r, the 
tree does not contain a state. □ 

Let bas-MAC denote the class of tree transformations which are induced 
by basic macro tree transducers. Next we define a tree transformation Tpath 
for which we will prove that Tpath E bas-MAC — ATT, 

Definition 6.4. Let S = and A = Define 

the tree transformation Tpath : Tj; — ► such that 

Tpath - {{y''{a),apprev{sn^i)) | n > 0} 

where Sn is the fully balanced tree of height n (as defined in Example 3.29) 
and, apprev{sn) is the tree which is obtained from Sn by substituting, for 
every w G occ{sn) such that label{snyw) = a, that occurrence of a by the 
unary tree Figure 6.2 illustrates the function Tpath- ^ 



Theorem 6.5. bas-MAC — ATT ^ 0. 

Proof. We show that Tpath E. bas-MAC. Define the macro tree transducer 
M = (Q, i7. A, g, R, E) by 

• E and ^ as in Def. 6.4, 

• R contains the rules: 

1) 9(7(«i).S/i) -*^0’(9(®i.l(j/i)).9(a:i,2(yi))), 

2) g(a,yi) -^yi- 

• E = (#). 

It is easy to see that tm = Tpath- Moreover, it is also easy to see that 
T/Vf (t”(<^)) contains 2*^ different subtrees of the form Due to the 
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subtree-lemma of attributed tree transducers (Lemma 5.43), there can be no 
attributed tree transducer A such that ta = tm- O 

How can we restrict macro tree transducers so that they have the same 
transformational power as attributed tree transducers? We do not know the 
answer to this question. However we can characterize a large subclass of 
ATT in terms of restricted macro tree transducers. The restriction amounts 
to considering well-presented macro tree transducers, and the large subclass 
is the class of absolutely noncircular attributed tree transducers. 

Before defining the property well-presented, let us work through the idea 
behind this restriction. Consider again the basic macro tree transducer M of 
Theorem 6.5 which computes the tree transformation Tpath (Def. 6.4). Let us 
now try to apply the inverse of the construction in which, for some attributed 
tree transducer A, a macro tree transducer M has been constructed such that 
Ta = Tftf (Lemma 6.1). Clearly, it is not possible that the inverse construction 
will work, because otherwise MAC C ATT and this contradicts Lemma 6.2. 
However we will investigate what makes the inverse construction fail, and 
this will lead to the definition of well-presented. 

If we apply the inverse construction of Lemma 6.1 to M, then we obtain 
an attributed tree transducer A which has onp svnthftsizpH attrihnf.p n and 
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one inherited attribute y\ . Since 7 is an input symbol, there must be a set 
ily of rules in A, and in there must be a rule q(ir) — ► ^ . By applying the 
inverse unfolding process to the right-hand side of the rule 5(7(2?! ), 2/1) 
<r{q{xi^ 1(2/1)), g(xi, 2(2/1))), it is clear that ^ = a{q{Trl), 5(7 t1 )). However, now 
there are two different terms, viz. l(yi) and 2(2/1), which are to be considered 
as the result of the unfolding process. Inverting this unfolding means that 
there are two rules 

yi(7Tl) -*• l(yi(7r)) 
yi(irl) 2(yi(7r)) 
in Ry which, clearly, is not allowed. 

Thus, in order to use the inverse construction, the macro tree transducer 
should have the property that, considering the set of all (7-rules (for some 
input symbol a), whenever there are two states q and p which are called 
on the same input subtree, then the parameter positions of q and p should 
contain the same terms. This is the notion of super well-presented. 

In fact, it is possible to relax this condition a little by associating with 
every state q a subset 7(5) of its parameter positions and requiring the equal- 
ity of terms only for the parameter positions in the intersection j{p) fl 7(5). 
This is the notion of well-presented. 

Definition 6.6. Let M = (Q, i7, 4, 50, R^ E) be a macro tree transducer. 

1) M is well-presented if there is a function jm • Q ^(^) such that for 

every q,q' £ Q, k > 0, and <r 6 the following properties hold: 

(3*) TM (5) ^ 'P 0 ^rank{q)—l) 1 

(b) if q{(r{xiy . . . , xjb), 2/1, • , 2 /n) C is a rule in R and yj occurs in 

then yj G 7m (5), and 

(c) if 5((7(a?i,...,XA:),2/i,...,yn) ^ ^ and q'{(r{xi, . . . ,Xk),yi, . , . ,yn') 

are rules for a, and p(a:i,^i, . . .,^m) and p'(ari,^J, . . .,^^/) are 
subtrees of ^ and respectively, then, for every yj G 7 m(p)H 7 m (pOj 
the equality holds. 

2) M is super well-presented if M is well-presented under the function 7 m 

such that for every q EQ, the value jm{q) = yrank(q)-i- 

Note that, if M is super well-presented, then condition (b) is meaningless. 
Moreover, condition (c) requires that ^i = ^i, . . where we assume 
without loss of generality that m < w! . Theorem 6.5 uses a macro tree 
transducer which is not well-presented because it violates condition (c). 

Let the class of tree transformations which are induced by (super) well- 
presented macro tree transducers be denoted by wp-MAC (and swp-MAC^ 
respectively). It is clear that every super well-presented macro tree transducer 
is also well-presented. 

Note 6 . 7. swp-MAC C wp-MAC. 
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Next we show a well-presented macro tree transducer which is not super 
well-presented. 

Example 6.8. Consider a well-presented macro tree transducer M with states 
p and g, both of rank 3. The function y which associates parameters to states, 
is defined by y{p) = {t/i} and y{q) = {2/2}- Moreover, M has the rule 

p{<r{xi , X 2 ), yi , j/2) -♦ P{xi. , q{xi, a,/3),j) 

where a, and 7 are nullary output symbols. Then M is well-presented, 
because y{p) fl y{q) = 0. However, M is not super well-presented because 
p(a?i, a, /?), 7) and g(ari, a, /3) are two subtrees of the right-hand side of 
some (7-rule of M which both refer to xi, and, e.g., g(a:i,a,j0) ^ a. □ 

It is also clear that every superlinear macro tree transducer (Def. 4.27) is 
super well-presented. 

Note 6.9. sI-MAC C swp-MAC. 

By transforming a well-presented macro tree transducer M into an at- 
tributed tree transducer A^ we can show that A is restricted; it is absolutely 
noncircular. Also, by transforming a super well-presented macro tree trans- 
ducer M into an attributed tree transducer A, we can show that A is a 
restricted absolutely noncircular attributed tree transducer; it is even one- 
visit. Finally, by transforming a basic well-presented macro tree transducer 
M (i.e., M is basic and well-presented) we obtain a nonnested attributed 
tree transducer A, where nonnested is a restriction of one- visit. Before we 
show the transformations, let us define the three restrictions on attributed 
tree transducers. The definitions of is-graph and composition of graphs were 
given in Defs. 5.15 and 5.16. 

Definition 6.10. Let A = (Atf , i7. A, ao, i?, E) be an attributed tree trans- 
ducer. A is absolutely noncircular if there is an is-graph IS = (Att^E) over 
Attsyn and Attinh such that, for every a G with A: > 0, the following 
properties hold. 

1 ) Da (o') [75, . . . , IS] is acyclic where the rank rank{(r) determines the num- 
ber of occurrences of IS and 

2) if (c, c') is an edge of zs(Da((t)[/ 5, . . . , 75]), then (c, c') is also an edge 

of IS. □ 

The class of tree transformations which are induced by absolutely noncir- 
cular attributed tree transducers is denoted by anc-ATT. 

Note 6.11. anc-ATT C ATT. 

Example 6.12. There are noncircular attributed tree transducers which are 
not absolutely noncircular. Figure 6.3 shows three dependency graphs of such 
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an attributed tree transducer A. Let us assume that the inherited attributes 
are called 6i and 62 and the synthesized attributes are called ai and 02- In 
Fig. 6 . 3 , the attribute occurrences at every node are arranged from left to 
right as follows: 6 i(7t), 62(7t), ai(7r), and a2(7r). 

Clearly, in order to fulfill condition 2 of Def. 6 . 10 , the is-graph IS should 
contain edges from 61 to 02 (due to DA(ct)) and from 62 to ai (due to Ba(j 3 )). 
But then condition 1 is violated because Da(7)[IS\ contains a cycle. 

However, due to Lemma 5 . 17 , A is noncircular because is-setA contains 
two is-graphs (one with one edge from 61 to 02 and the other with one edge 
from 62 to ai) and £>^(7) [is] does not contain a cycle for any choice of is 
from is-setA • 

It is obvious that, in an absolutely noncircular attributed tree transducer, 
the is-graphs of every possible input tree are merged together resulting in an 
is-graph, say, is, and it is required that the composition of Da(<^) (for every 
input symbol a) and is does not contain a cycle. Then it is entirely possible 
(as in Example 6 . 12 ) that there is no input tree which has is as an is-graph. 




Fig. 6.3. Dependency graphs of a not absolutely noncircular attributed tree trans- 
ducer 

The one- visit restriction is defined by means of the notion of brother 
graphs. 
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Definition 6 . 13 . Let A = {Att, U, Ay ao, iJ, E) be an attributed tree trans- 
ducer and let a G for some A? > 0. 

The brother graph of a is the directed graph BGa = {N,E)y where 

• N = Xk is the set of nodes, and 

• £* is the set of edges; E is the smallest subset of Xk x Xk such that for 
every 1 < iyj < Ar, if there is a 6 G AtUnh and an a G Atigyn such that 
a(7ri) occurs in rhs(6(7rj), (t), then there is an arc (xi,Xj) G E. 

Definition 6 . 14 . Let A = {Atty Ey Ay ao, iZ, E) be an attributed tree trans- 
ducer. A is one-visit if, for every a £ Ey the brother graph BGa does not 
contain a cycle. □ 

The class of tree transformations which are induced by one- visit attributed 
tree transducers is denoted by \v-ATT. 

If an attributed tree transducer A is noncircular, then a graph 
Da(o')[/ 5 , . . . , lS\y where a G E^^\ Ar > 0 , and IS is an is-graph, can contain 
a cycle only if the brother graph of a is cyclic. Hence the following note is 
justified. 

Note 6.15. Iv-ATT C anc-ATT. 

Example 6.16. There are absolutely noncircular attributed tree transducers 
which are not one- visit. In Fig. 6.4 we show the two dependency graphs of 
an attributed tree transducer A, again with inherited attributes h\ and 62 
and synthesized attributes a\ and 02, and with the same arrangement as in 
Fig. 6 . 3 . It is easy to check that the brother graph Ba contains a cycle due 
to the dependencies ((ai, ttI), (62, 7t 2)) and ((02, 7 t2), (62, ttI)). However, A is 
absolutely noncircular: consider the is-graph IS which contains edges from 
61 to 02 and from 62 to a\. Both conditions of Def. 6.10 are fulfilled for this 
IS. □ 

Finally, we introduce the notion of a nonnested attributed tree transducer. 
Briefly, an attributed tree transducer is nonnested if there is no inherited 
attribute which depends on a synthesized attribute. 

Definition 6 . 17 . Let A = {Atty Ey Ay ao, R, E) be an attributed tree trans- 
ducer. A is nonnested if, for every A:>l,l<i<Ar, 6 g AtUnh, and <t G E^^^y 
the right-hand side rhs( 6 ( 7 ri), (t) does not contain any attribute occurrence 
a(7rj) with a G Attsyn and 1 < j < Ar. □ 

The class of tree transformations which is induced by nonnested at- 
tributed tree transducers is denoted by nn-ATT. It is clear that every 
nonnested attributed tree transducer is also one- visit because all its brother 
graphs are empty. 



Note 6.18. nn-ATT C Iv-ATT. 
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Fig. 6.4. Two dependency graphs of a not one- visit attributed tree transducer 



It is also clear that every top-down tree transducer is a nonnested at- 
tributed tree transducer. 

Note 6.19. TOP C nn-ATT. 

Example 6.20. Clearly, there are also attributed tree transducers which are 
one- visit, but not nonnested. In Fig. 6.5 there are the two dependency graphs 
of a one- visit attributed tree transducer which is not nonnested. The latter 
(non)property is due to the rule for b{ir2): in its right-hand side it contains 
a(7rl) where a is a synthesized attribute. □ 




Fig. 6.5. Two dependency graphs of an attributed tree transducer which is not 
nonnested 
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Now we can prove that every well-presented macro tree transducer can 
be transformed into an absolutely noncircular attributed tree transducer. 
The same transformation is possible for super well-presented and basic well- 
presented macro tree transducers; this leads to one- visit attributed tree trans- 
ducers and nonnested attributed tree transducers, respectively. 

Lemma 6.21. 1) wp-MAC C anc-ATT 

2) swjhMAC C Iv-ATT 

3) bas-wp-MAC C nn-ATT. 

Proof. We show the construction for the first statement of this lemma, and 
then we show that this construction leads to the desired restrictions of the 
second and third statements of this lemma. 

Let M = (Q, i7, qo^ R, E) be a well-presented macro tree transducer. 
Then there is a function jm • Q which fulfills the conditions of 

Def. 6.6. Let a be an arbitrary element in Denote max{n | ^ 0} 

and max{n I ^ 0} by max(M,X) and max(M, Y), respectively. We 

construct an attributed tree transducer A = {Att^ A^qo^ E^) such that 
ta = tm- Let A be defined as follows: 

^ An An gyfi LJ where A.n gyfi — and An^fifi — \.yj I ^ J ^ 

max{M^Y)} 

• iZ': First we construct the set U< 7 g 17 second we add appropriate 

dummy rules to the sets of this family if necessary. 

For every a £ E^ the set is the smallest set for which the following 
condition holds: if q{a{xi , . . . , xjb), 2/i, . . . , t/n) ^ is in /Z, then 

{g(7T) top(0} U DECOMPOSEiO C iZ^. 

The function top : RHS{QyAjmax{M,X),max{M,Y)) RHS{Attsyn, 
Attinhi Aj max{Mj X)) is defined by structural induction on its argument. 

(i) For every p G with / > 0, 1 < i < max{M^X)^ and G 

RHS{Q, A, max{M,X), max{M,Y)), define , . . . ,^/)) = 

p{'Kl). 

(ii) For every 6 G A^^^ with / > 0 and ^i, . . . G RHS{Q^ A, max{M, X), 
max{M,Y))j define 

iop{6{Ci,...,^i)) = 5(top(^i),...,top(^/)). 

(iii) For every yj G Ymax(M,Y), define top{yj) = yj{Tr). 

The function DECOMPOSE : RHS{QjA,max{MjX)^7nax{MjY)) 
V{{1 r 1 / r is a rule of an attributed tree transducer}) is also 

defined by structural induction on its argument. 

(i) For every p G with / > 0, define 

DECOMPOSE{p{xi,^i , . . . , 6)) = 

{yj{ni) top{^j) I yj G Tm(p)} U Ui<i</ DECOMPOSE{^j). 
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(ii) For every 6 € A^''> with / > 0, define DECOMPOSE{6{^i , . . . = 

\J,^j^iDECOMPOSE{^j). 

(iii) For every yj G V'mo®(M,y). define DECOMPOSE{yj) = 0. 

Note that, due to the fact that M is well-presented, it is not possible that 
there are two different rules in DECOM POSE{p{xi,^iy . . .,^/)) with the 
same left-hand side yj{iri). Also note that DECOMPOSE terminates, 
because it is applied to the right-hand sides of the rules of M, i.e., finite 
trees. This finishes the first step in the construction of /J'. 

Now consider an input symbol a G If, for some i with I <i < k and 
some j with I < j < max{M^ y), there is no rule in with left-hand side 
yj{ni), then add the dummy rule 

yj(iri) a 
to 

• Let £■ = (ti, . . . ,fr)- Note that r is the rank of the initial state qo of M 
and that r < max{M,Y). Then we define the function : AtUnh — ► Ta 
as follows: for every I < j < r, let E'{yj) = tj and for every r + 1 < j < 
max(M,Y), let E^{yj) = a. 

The correctness of the construction follows from the following statements 
which can be proved by simultaneous induction. 

K: For every q e Q and s £ Ts, TM,q{s) = TA,q{s)[bj{e) yj]l < j < 
max{M,Y)]. 

L: For every q E k > 0, a £ ^ £ sub{rhs{q,(r)), and lj = 

(«!,•••, Sib) € ^ w;! < j < 

max{MyY)]. 

Now we have to show that A is absolutely noncircular. Since M is well- 
presented, there is a function jm Q ^ with the three properties given 
in Def. 6.6. We construct the is-graph IS with Attsyn U Attinh as a set of 
nodes and, if yj £ 7m (^) j there is an edge from yj to q. Now assume that, 
for some a £ the graph G defined by G = Da{o’)[IS, . . . , IS] is cyclic. 
Then there is a sequence 

9l (5Til ) , yj, (7Ti2) , . . . 9r- 1 (irir- 1 ) , %V - 1 (’Tir ) , 9r (5Tir ) 

of attribute occurrences such that 1 < ii, . . ., iV-i, V < fc, for every pair of 
consecutive attribute occurrences there is a corresponding edge in G, and 
qi{mi) = gr(^ir) (making G cyclic). 

Since yjiiT^h) depends on gi(7rii) and since gi(7T2i) = grC^rir) there must 
be a (7-rule A — ► p in M of which the right-hand side contains the sub- 
tree • • -,6) for some Since (%v_i(7rir), gr(^ir)) is an 

edge of G, there is an edge (t/jV-u^r) in IS, and hence, by definition of 
IS, yj^^i G 7M(gr)- Since 2/ir-i(^*r) depends on gr-i(^^r-i) and since the 
construction will deduce the right-hand side of the rule for from 




6.1 Compaxison of MAC and ATT 185 



the parameter term parameter term has to contain the subtree 

some Note that, if there is another oc- 
currence of a subtree qri^iri^ii • • • j^/) for some in some right-hand 

side of a (7-rule, then Cjr-i — because M is well-presented. 

Now we can repeat the same argument for 9r-i(7rv-i), as for gr(^v)- If 
we repeat this argument r— 1 times, we draw the (global) observation that the 
jr-i“th parameter term of the subtree . .,^/) contains the subtree 

• • • >^///) for some^i', . . . Since gi(7rii) = grC^rir), the two terms 

• • • ) and qr{xi ^ , . . . , are syntactically equal. Moreover, by 

qi = qr, we have 7 m(9i) = TM(^r)- Hence G 7 m(^i) which implies that 
^ir-i = other hand, is a proper subtree of Hence, 

there is a contradiction with the assumption that G contains a cycle. 

Also we have to prove that, if (c, c') is an edge of is{DA{(r)[IS , . . . , 75]), 
then (c, c') is also an edge of IS. Let (yj,q) G AtUnh x Attsyn be an el- 
ement of 2 s(Da(( 7)[75, . . . , 75]). Hence, by Def. 5.15, there is a path in 
D^((t)[ 75, . . . , 75] from j/j(7r) to q{ir). Considering the construction of A such 
a path can only exist if the right-hand side of the (g, (r)-rule contains yj . Then, 
by Def. 6.6, yj G jM{q) and thus, by the construction of 75, {yj , g) E 75 also. 

Now assume that M is super well-presented. Then it is not possible 
that there is a right-hand side of a rule of M with a subtree of the form 
p{xij . . . g(x,-, ...)...). Hence, by the definition of DECOMPOSE^ A is one- 
visit. 

Finally assume that M is basic and well-presented. Then it is immediately 
clear from the construction that A is nonnested. □ 

Now we show that the inclusions of Lemma 6.21 also hold in the other 
direction. In fact, the construction of the following lemma is quite similar to 
the construction of Lemma 6.1 {ATT C MAC). But here the unfolding of 
the right-hand side is stopped by the function Jm ' Q 'P(^) rather than 
by a set P. 

Lemma 6.22. 1) anc-ATT C wp-MAC 

2) Iv-ATT C swjhMAC 

3) nn-ATT C bas-wp-M AC . 

Proof. We show the construction for the first statement of this lemma, and 

then we show that this construction also implies the second and third state- 
ments of this lemma. 

Let A = {Att^ i7, Zi, ao, 7?, E) be an absolutely noncircular attributed tree 
transducer such that AtUnh = {biy...,br} for some r > 0. Let a G be 
an arbitrary, but fixed, output symbol. 

We construct the macro tree transducer M = (Q, 77, A, ao, R\ E*) as fol- 
lows. The set Q is equal to Attsyn and every a E Q has rank r -h 1. Next we 
consider a (7 G for some fc > 0 and we construct the ( 7 -rules from the 
elements of . 
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By Def. 6 . 10 , there is an is-graph IS = {Att^ E) with certain properties. IS 
induces the function 7/5 : Q V{Y) by defining 7/5(0) = {yj | {bj^a) E E] 
for every a G Q. 

Then, for every a G Q, we construct the rule 

a{(T{xi,...,Xk),yi,...,yr)^t 
where t = COMPOSE{rhs{a{Tr),<T)). The function 

COMPOSE : RHS{Attsyn,Attinh, A, k) RHS{Q, A, k, r) 

is defined inductively as follows. Let f G RH S{Atisyn, Attinhj k). 

(i) If ^ = c(7Tz) for some c G Attayn and I < i < k, then COMPOSE{^) = 

. . . ,^r) where, for every 1 < i < r, 

. _ / a if Vi t 7fs(c) 

- lCOMFOSF(rhs(bj(7ri),a)) if yj ejis{c). 

(ii) If ^ = bj{Tr) for some bj G AiUnh, then COM POSE{^) = t/j. 

(hi) If I = 5 ( 6 , ...,^i)foiSeA and 6, • • • ,6 ^ RHS{Attayn,AUinh, k), 
then COMPOSEiO = 6{COMPOSE{^i ), . . . , COMPOSE{^i)). 

Note that the definition of tj is acyclic, because by Def. 6.10 1 ) the graph 
Da{o‘)[IS, . . . , IS] has no cycle. 

Moreover, let E' = (£*(61), . . . , E{br)). 

The correctness of the construction follows from the following statements 
which can be proved by simultaneous induction. 

K: For every a G Attayn and s G Tj;, TA,a{s)[bj{e) yj]l < j < r] = 

TM,a{s), 

L : For every fc > 0 , (t G ^ G RHS{<r), and u = {si, ... ,Sk) £ , 

T'A,a,^{(*^)[bj{e) yj\ I < j <r] = tm,compose(o(^) 

It is easy to see why M is well-presented. For this purpose consider some 
(a, cr)-rule of M. By Def. 6.10 2 ), is{DA{(r)[ISy . . . , IS]) C IS, i.e., we can say 
that 75 is closed under embedding in dependency graphs, and by definition 
of JiSi it follows that the elements of 7/5(0) at most appear in rhs{a, a). This 
means that Condition 1 (b) of Def. 6.6 is fulfilled. Condition 1 (c) is derived 
from the fact that, for every inherited attribute bj at some i-th descendant 
of a (7-node, there is a unique rule in R. Since COMPOSE is a function, the 
term COMPOSE{rhs{bj{wi),(r)) is also unique. 

Now assume that A is one- visit. Define the function jfuu : Q — > P{y) such 
that 7/u//(o) = {2/1, . . . , 2/r} for every a G Q. Then we have to show Condition 
1 (c) of Def. 6.6 for jfuU- For this purpose consider two (not necessarily dif- 
ferent) rules a{<r{xi ,. . . , Xk), yi, . . . , yr) ^ ^ and a\<r{xi, ..., Xk), yi, • . . , yr) 
— > for ( 7 , and let c{x { , 6 > • • • ) 6) and c'(x,* , 6 > • • • > ^r) subterms of ^ and 
6, respectively. Moreover consider a parameter position j. If c = c', then 
clearly 6 = Now let c ^ c'. Then, if 6 ^ then either {yj G 7/5(0) and 
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Vj ^ lis{c*)) or {yj ^ lfis{c) and yj G 7/5(c0)- But by the definition of 7 /^//, 
every yk is in 7/ti//(^)- Hence, The definition of COMPOSE still ter- 

minates, because A is one-visit and hence, no recursive call of COMPOSE 
on the same subtree can arise. Thus, M is super well-presented. 

If .A is nonnested, then, for every inherited attribute 6, input symbol (T, and 
position iy the right-hand side rhs{b{wi)y(r) does not contain a synthesized 
attribute occurrence. Thus, COMPOSE does not generate a call to a state 
inside the parameter position of another state. Hence, M is basic. Of course, 
M also remains well-presented. □ 

From Lemmas 6.21 and 6.22 we obtain the following connection between 
the classes of tree transformations which are computed by macro tree trans- 
ducers and attributed tree transducers. 

Theorem 6.23. 1) wp-MAC — anc-ATT 

2) swp-MAC = Iv-ATT 

3) bas-wp-MAC = nn-ATT, 

By means of this theorem, we can relate the class of YIELD-functions to 
the class ATT. 



Corollary 6.24. YIELD C ATT. 



Proof. 

C 


YIELD 

sl-MAC 


(Lemma 4.32) 


C 


swp-MAC 


(Note 6.9) 


= 


Iv-ATT 


(Theorem 6.23) 


c 


ATT 


(Def. 6.14) 



Another important consequence relates the classes of tree transforma- 
tions computed by (arbitrary) macro tree transducers and by attributed tree 
transducers. 

Theorem 6.25. 1) MAC = HOM o \v-ATT 

2) MAC = HOM o ATT 

3) MAC = TOPoATT. 

Proof. 

MAC 

= HOM o sTMAC 
C HOM o swp-MAC 
= HOM o Iv-ATT 
C HOM o ATT 
C TOP o ATT 
C TOP o MAC 
C MAC 



(Corollary 4.39) 

(Note 6.9) 

(Theorem 6.23) 

(Note 6.15 and Def. 6.14) 
(Corollary 3.32) 

(Lemma 6.1) 

(Corollary 4.38). 
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Theorem 6.26. For every n > 1, MAC'^ = HOM o ATT^. 

Proof. The proof is a simple induction on n. For n = 1, the statement holds 
by Theorem 6.25. 

= MAC” o MAC 
= HOM o ATT” o MAC 
= HOM o ATT^ o TOP o YIELD 
= HOM o ATT^ o YIELD 
C HOM o ATT”+^ 

C HOM o MAC”+i 
C MAC”+i 



6.2 Inclusion Diagram Based on 
and MAC 

In this section we assemble in one diagram (Fig. 6.6) most of the inclusion 
and incomparability results which have been proved for the classes TOP, 
YIELD, ATT, and MAC (and their subclasses). It is the aim of this section 
to prove that this diagram is an inclusion diagram in the sense of Section 2.2. 
The proof is performed by applying the five steps of Algorithm 2.1: 

Step 1: The diagram in Fig. 6.6 is the conjectured diagram. 

Step 2: The set INC L consists of the following 12 formal inclusions (the 
reader should disregard the references for the time being): 



1. 


HOM 


c 


TOP 


(Corollary 3.32) 


2. 


l-TOP 


c 


TOP 


(Corollary 3.32) 


3. 


sl-TOP 


c 


l-TOP 


(Corollary 3.32) 


4. 


YIELD 


c 


sl-MAC 


(Lemma 4.32) 


5. 


sl-TOP 


c 


sl-MAC 


(Note 4.28) 


6. 


TOP 


c 


nn-ATT 


(Note 6.19) 


7. 


sl-MAC 


c 


swp-MAC 


(Note 6.9) 


8. 


bas-wp-M AC 


c 


bas-MAC 


(Def. 6.6) 


9. 


nn-ATT 


c 


Iv-ATT 


(Note 6.18) 


10. 


Iv-ATT 


c 


ATT 


(Notes 6.15 and 


11. 


ATT 


c 


MAC 


(Lemma 6.1) 


12. 


bas-MAC 


c 


MAC 


(Def. 6.3). 



and the set IN EQ consists of 38 formal inequalities the details of which are 
not shown here. 

Step 3: Every formal inclusion is true. We have already cited in the list 
in Step 2 the appropriate corollaries, lemmas etc. 



(by induction hypothesis) 
(Theorem 4.37) 

(Theorem 5.47) 
(Corollary 6.24) 

(Lemma 6.1) 

(Corollary 4.38). 



TOP, YIELD, ATT, 
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Fig. 6.6. Inclusion diagram for TOP, YIELD, ATT, and MAC 



Step 4: The set MINEQ consists of the following 9 formal inequalities: 

1. bas-MAC - ATT ^ 0 

2. ROM - sl-MAC ^ 0 

3. l-TOP - sl-MAC ^ 0 

4. sl-TOP - ROM 0 

5. sl-TOP - YIELD # 0 

6. YIELD - bas-MAC ^ 0 

7. ATT - Iv-ATT # 0 

8. nn-ATT - TOP f 0 

9. ROM - l-TOP ^ 0 

Step 5: In the following subsections we will verify these nine conjectured 
inequalities so that we obtain the desired theorem: 

Theorem 6.27. The diagram shown in Fig. 6.6 is an inclusion diagram in 
the sense of Section 2.2. 



6.2.1 The inequality bas-M AC — ATT ^ 0 
This inequality has been proved already in Theorem 6.5. 
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6.2.2 The inequality HOM — sl-MAC ^ 0 

In Example 3.29 we defined a tree transformation Tdoubie which translates the 
monadic input tree 7” (a) into the fully balanced output tree Sn+i- Actually, 
there is also a superlinear macro tree transducer M which can compute this 
tree transformation. M has the following rules 

q{yixi),yi) -* q{xi,(T{yi,yi)) 

q(o‘,yi)^yi- 

Thus, M uses its parameter position to accumulate the fully balanced tree 
Sn+i- In particular, the topmost 7 of the input tree generates the lowest (t’s 
of the output tree, and, vice versa, the lowest 7 of the input tree generates 
the topmost <r of the output tree. Thus, we can say that, M reverses the 
order of the input symbols. Now it is easy to modify Tdouhie in such a way 
that the order of the input symbols is preserved in the output tree. Then, 
this modified tree transformation can no longer be computed by a superlinear 
macro tree transducer. 

Definition 6.28. Let E = {7i^^72^\ and A = \ (t^\ • Let 

TdoubU^ordered be the tree transformation which is computed by 

the homomorphism tree transducer H = ({g}, i7, Zi, g, ii) where R contains 
the rules 

1) g(7i(a?i)) -^o'i(g(a?i),g(xi)) 

2) g(72(a?i)) <^2{q{xi),q{xi)) 

3) q{a) —►a. □ 

Theorem 6.29. HOM — sl-MAC ^ 0. 

Proof. By Def. 6.28, Tdoubie^ ordered 6 HOM. We will show that 
Tdoubie-- ordered ^ sl-MAC. We prove by contradiction. We assume that there 
is a superlinear macro tree transducer M = (Q,E,A,qojR,E) such that 
Tjv/ — Tdouble—ordered • AsSUme that E ~ (fl> • • • 

We proceed by making a sequence of observations. 

Observation 1: The (go)7i)-rule contains a?i in its right-hand side. The 
same holds for the (go,72)-rule. 

We prove by contradiction. We assume that, for ^ = Ths{qo,yi), ^ G 
Then, for every input tree s G Tj;, we have 

qoiliis),yi,...,yr) =>M C 

and tm{7i{s)) = ^[t/i ^ ti, . . .,2/r ^ U]- This means that tm{7i{s)) does 
not depend on s, which contradicts tm = Tdoubie- ordered- 
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Observation 2 : The (go, 7i)-rule in R has the form 

qo{li{xi),yi, . . . ,yr) -* p{xi,ui, . . . ,uk), 

for some p G and «i, € T^(Yr). The same holds for the (go, 72)- 

rule. 

This observation can also be proved by contradiction. Assume that the 
root of the right-hand side of the (go,Ti)-rule is an output symbol from A. 
Then, the rule should be of the form 



go(7i(®i), yi, • • • ,!/r) 

where ^1,^2 ^ RHS{Q,A, l,r), such that or ^2 contains x\. The root of 
the right-hand side should be ai because, for every tree s ^Ts, rAf( 7 i(s)) 
should have the form (Ti{u, u), for a suitable tree u G T^. Moreover, both 
and ^2 cannot contain x\, because M is linear. Hence either or ^2 contains 

X\, 

Assume ^2 contains x\. Then, of course, ^1 G Let now s £ Ts 

be an arbitrary tree and consider the input tree 71 (s) to M and compute 
tm{ 7 i{s))- Then, we have 



go( 7 i(«).yi,---,yr) =>-m ^s]) 

=^Af ‘^1(6,6) 



such that ^2 € also and that 

r»f(7l(«)) = 0’l(6bl *-<r],6[yi *-<r])- 

However, this means that, for every input tree s ETs^ the first subtree of the 
output tree rM(7i(«)) is the same tree, i.e., it is ^i[yi ^ ti, . . . , 2/r fr]- This 
contradicts tm = Tdoubie-orderedy which arises from our assumption on the 
shape of rhs(go, 7i)- The case when contains x\ also yields a contradiction, 
by symmetry. 

With this we have proved that rhs(go,7i) can contain x\ only if its root 
is a state. Then the condition u\,, , .,ujc G T^{Yr) follows from the fact that 
M is linear. 

Since the roles of 71 and 72 are exchangeable, we have also proved the 
observation for 72. With this we have proved Observation 2 . □ 

Observation 3 : The (go, 7i)-rule in R has the form 

go( 7 i(a?i), qo{xi Ur), 

for some u\,. , .,Ur G T^(Vr). The same holds for the (go, 72)-rule. 

We prove by contradiction again. We assume that the (go, 7i)-rule has the 
form 

gofTi L 1/1 Vr) — ^ v(x^ , ui tifcL 
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such that p / go and ui, . . . , t/jb ET^{Yr). Since xi occurs in r/is(go, 7i) and 
M is superlinear, x\ cannot occur in r/is(p, 71). Hence the (p, 7i)-rule should 
be of the form 

where ^eTji{Yk). 

Consider now an input tree to M of the form 71(71(5)), where s ETjj is 
arbitrary and compute tm ( 71(71(5)))- We have the derivation 

go(7i(Ti(«)), 2/1, • • • , 2/r) p{ 7 i{s), ui, . . . , Uib) 

=>M ^[yi ^ Uk], 

hence, by letting ^ = ^[yi ui,...,yjb ^ izjb], we get rjvf (71(71 («))) = 
^[2/1 ^ ^1, • • • , 2/r ^ ^r], regardless of 5. This means that (71(71(5))) does 
not depend on 5, which contradicts tm = Tdoubie- ordered- The contradiction 
arises from the assumption p ^ go- 

Since the roles of 71 and 72 are exchangeable, we have proved Observation 
3 for 72, too. □ 

We summarize the proof so far. The assumption tm = Tdoubie- ordered 
implies that the set Q of (relevant) states of M is the set Q = = {go}, 

and that Ji consists of the rules 

• go(7i(a?i),2/i,-“,2/r) -^go(aJi,i^i,-.-,t^r), 

• 9o(72(iPi), 2/1 , • • • , 2/r ) -► go(«i, Vi, . . . , Vr)y and 

• go(of,2 /i,--, 2/r)--^^, 

where t,ui, . . .,Ur, vi, . . ., Vr G T^(Vi-)- 

In order to derive a contradiction, we transform M into an equivalent 
attributed tree transducer A. Since M is superlinear, M is also super well- 
presented (by Note 6.9). Hence we can apply the construction of Lemma 6.21 
and obtain a one- visit attributed tree transducer A such that tm{s) = 7*^(5), 
for every input tree 5. Let E* be the environment of A. Define 

max = max{height{^) | ^ is a right-hand side of a rule of A} + 
max{height(E^{yj)) | yj is an inherited attribute of A}. 

Analyzing the set of rules of A, we find that contains the rule go(7r) — »• 
go(7rl) and Ry^ contains the rule go(7r) —*■ go(^l)- 

Now consider an input tree 5 with height max + 1 and an arbitrary deriva- 
tion d of A on 5 of the form 



90(e) 'p 

such that ta{s) = <p[yi{e) *- E'{yi), .... j/r(e) <- E'{yr)]. 

Due to the form of the rules, the derivation d starts with height{s) — 1 
applications of the chain rule go(7r) — go(^l) without producing any output 
symbol. Thus, d has the form 
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9o(^) =>A,5 9 o(1) =>A,s 9o(11) • • • =^A,5 ^A,s P* 

Let us consider the occurrence u; in s, at which A produces an output 
symbol for the first time during d. Due to the form of d, this output symbol 
is produced by an inherited attribute on the ascent of A (from the leaf to the 
root of s). 

It is not possible that w = e, i.e,, that A does not produce the first output 
symbol until returning to the root of s again, because otherwise A would have 
to produce the whole output tree by means of the environment. However, 
every tree in the environment is too small to fulfill this requirement. It is 
also not possible that w = 1, because otherwise A would have to produce the 
whole output tree in its last derivation step, followed by the substitution of the 
environment trees. However, this is not possible either, because height{s) = 
height{rA{s)) > max. Thus w ^ {e, 1}. 

Next consider the input tree s' which is obtained from s by changing the 
label of the root of s (i.e., if the root of s is 71, then the root of s' is 72, and 
conversely). 

Clearly, occ{s) = occ(s') and A will also produce the first output for s' at 
occurrence w and this output symbol will be the same as in the derivation 
d (since w ^ {e, 1}). Hence, the roots of r>i(s) and ta(s') will be the same. 
However, the roots of rdoubie-ordered(s) and TdoubU- ordered are not equal. 
Since ta = Tdoubie- ordered^ we have derived a contradiction to the assumption 
that Tdouble— ordered ^ sTJM AC and hence, Tdouble— ordered ^ sl-AIAC. Q 



6.2.3 The inequality l-TOP — sUMAC 0 

We prove the above inequality by giving a linear top-down tree transfor- 
mation, called Tmoduioi and then proving that it cannot be induced by any 
superlinear macro tree transducer. The tree transformation Tmoduio is defined 
as follows. 



Definition 6.30. Let T = (Q, E, A, qqq, R) be the top-down tree transducer, 
where 

• Q = { 900 , 5l0, ^01, 9ll}j 



• R consists of the following 12 rules: 


1) 


900(7(2:1)) -^ 7 i( 9 io(a;i)), 


2) 


9oo(^(a:i)) (901(2:1)), 


3 ) 


910(7(2:1)) 70(900(2:1)), 


4 ) 


910(5(2:1)) ^ 51(911(2:1)), 


5 ) 


901(7(2:1)) -^71(911(2:1)), 


6) 


9 oi( 5 (xi)) — » 50(900(2:1)), 


7 ) 


911(7(2:1)) ^70(901(2:1)), 


8) 


9ii(5(a;i)) 5o(9io(2:i)), 


9)-12) 


Qijiot) 0, for every 0 < 
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By direct inspection, we see that T is linear. It is easy to see that T operates 
as follows: for every input tree s GTs, the output t^(s) can be obtained from 
s by substituting every occurrence of 7 (and S) by 70 or 71 (by or ^i) such 
that the i-th occurrence of 7 (5) counted from the root of s is substituted 
by 7mi (hy Smi)f where m,* = imod2. Moreover, the leaf a of s remains 
unchanged. This can be done in such a way that the number of the processed 
7’s and S^s is encoded in the states of T. In fact, T enters state qij if and only 
if i (and j) is the number of the processed 7’s (and 6’s, respectively) modulo 
2. Hence the initial state is goo- 

For example, compute 'Tr(7(^(5(7(a))))). We get the following derivation 



5oo(7(^(^(7(a))))) =^T 7i(«io(^(^(7(«))))) 

7i(^i(9ii(^(7(«))))) 

=^T 7i(5i(^o(?io(7(a))))) 

=^T 7i(^i(^o(7o(9oo(a))))) 

=>T 7i(^i(^o(7o(«))))- 

Hence we have 77’(7(6(5(7(a))))) = 7i(^i(^o(7o(a))))- 

In the following we denote the tree transformation induced by T by 

'^modulo • ^ 



Theorem 6.31. UTOP — sl-MAC ^ 0. 

Proof. We prove by contradiction that Tmoduio cannot be induced by any 
superlinear macro tree transducer. Since the proof is similar to that of Theo- 
rem 6.29, the details are left to the reader. Refer to Sect. 2.7 for the method 
by which we denote trees over monadic ranked alphabets. 

Assume that there is a superlinear macro tree transducer M = 
(Q, i7. A, go, Ry E) with £* = (ti , . . . , tr ) such that Tmoduio = tm - 
We can show that R consists of the rules 

• «o(7(®i).yi.---.yr) 

• qoi6{xi),yi,...,yr)^qo{xi,vi,...,Vr),md 

• qo(a,yi,---,yr)^t, 

where, for 1 < i,j < r, Ui,Vj E T^{Yr)i and i G T^{Yr). 

We can prove this as follows. First let us consider the (go, 7)-rule. It should 
be clear that rhs(go,7) contains ari. Otherwise, for every tree s G Tr, the 
output Tm ( 7(5)) would not depend on s: this is a contradiction, for details 
see the proof of Theorem 6.29. On the other hand M is linear, hence x\ may 
occur at most once in r/is(go,7). So the form of the (go,7)-rule should be 

qo{j{xi), yi,..., yr) ->■ u{p{xi, «i, . . . , «*)), 

for some u E k > 0,p E and ui, . ■ - ,Uk G T/^{Yr). Next we 

can show that p = go. In fact, if we assume that p ^ go, then, considering the 
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(p, 7)-rule, we obtain that rhs{p, j) cannot contain xi. This is because, xi al- 
ready occurs in rhs{qo, 7) and because M is superlinear. However, if rhs{p, 7) 
does not contain x\, then, for every tree s ETs, the output tm{jj{s)) would 
not depend on s, which is a contradiction again. (Formally the proof follows 
that of Theorem 6.29.) Thus we get that p = qo and that the form of the 
(go,7)-rule should be 

9o(7(®i). - «(go(iCl, • • • , Ur)), 

for some u G (^^^^)* and «i, . . . , «r € Tji{Yr). 

Next we can show that u = 5:. In fact, assuming u ^ e, yte find that 
u contains at least one output symbol from A, Then, considering an input 
tree of the form 7” (a), we obtain that TAf(7^(a)) has the prefix vP, where 
u contains at least one output symbol from A, Since, by tm = Tmoduioi 
height{TM{'y^{c^)) should also be n -f 1, it also holds that u may contain 
at most one symbol. If u contains exactly one symbol, then the fact that 
TAf (7”(<^)) has the prefix u” yields an obvious contradiction to the assump- 
tion Tmoduio = TAf- Hence the only possible case is u = e, which proves 
that the (^o,7)“rule has the form we stated. Since the roles of 7 and 6 are 
exchangeable, the same holds for the (qo, 5)-rule. 

In order to derive a final contradiction, we apply the technique used in the 
proof of Theorem 6.29. That is, we transform M into an equivalent attributed 
tree transducer A = {Att, E, A, ao, R , £"). Then A is one-visit, see the proof 
of Theorem 6.29, and tm(5) = Ta(«), for every input tree s. Let be the 
environment of A. Define again 

max = max{height{^) | ^ is a right-hand side of a rule of A} 

-b max{heighi{E'{yj)) \ yj is an inherited attribute of A}. 

Analyzing the set of rules of A, we find that both Ri^ and contain the 
rule go(^) — ► 9o(^l)- 

Now consider an input tree s with height max + l and an arbitrary deriva- 
tion d of A on s of the form 



90(e) (p 

such that ryi(s) = y>[yi(e) <- yr{e) E'{yr)]. 

Due to the form of the rules, the derivation d starts with height(s) — 
1 applications of the chain rule go(^) —*■ 5o(^l) which produce no output 
symbols. Thus, d has the form 

90(e) go(l) 90(H) . . . <p. 

Let us consider the occurrence w in s, at which A produces an output 
symbol for the first time during d. Due to the form of d, this output symbol 
is produced by an inherited attribute on the ascent of A (from the leaf to the 
root of s). As in the proof of Theorem 6.29, we can see that neither w = e 
nor w = I because if this were the case then A would produce an output of 
height not greater than max for s. 
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Let us change the input tree s to s' by changing the label of the root of 
s (i.e., if the root of s is 7, then the root of s' is and conversely). 

Clearly, occ(s) = occ(s') and A will also produce the first output for s' at 
occurrence w and this output symbol will be the same as in the derivation 
d (since w ^ {e, 1}). Hence, the roots of ta(s) and ta(s^) will be the same. 
However, the roots of Tmoduio{s) and of Tmoduio{s^) are not equal. Since ta = 
Tmoduioi we have derived a contradiction to the assumption that Tmoduio G sl- 
MAC. Hence, Tmoduio ^ sl-MAC. □ 



6.2.4 The inequality sl~TOP — HOM ^ 0 

This inequality has already been proved in Lemma 3.35. 

6.2.5 The inequality sUTOP — YIELD ^ 0 

It is clear that, for every ranked alphabet i7, the identity tree transformation 
Id{Ts) can be computed by a superlinear top-down tree transducer (see 
Lemma 3.38). However, this does not hold for YIELD. 

Lemma 6.32. There is a ranked alphabet E such that Id{Ts) ^ YIELD. 

Proof. Consider the ranked alphabet E = From the definition 

of Y lELD (Def. 4.30), the following property can be derived immediately 
for this particular monadic ranked alphabet E. 

For every r > 0, every function g : E^^^ — ► T^(.^r), and every tuple 
^ ^ the function yield g i induced by g and t has the following 

property: 

for every s eTs, yieldgt(s) = g{a)[zi ^ti,...,Zr U]. 

Hence the range of every function yield g i is a singleton, i.e., a set with 
one element. But Id{Ts) has an infinite range. Thus Id{Ts) ^ YIELD. □ 



6.2.6 The inequality YIELD — bas-MAC ^ 0 

Intuitively, the Y I ELD- functions perform a kind of nested substitution (see 
Def. 4.30). As shown in Lemma 4.32, every YIELD- function can be com- 
puted by an appropriate superlinear macro tree transducer. There it was es- 
sential that, in the right-hand sides of the rules of that tree transducer, nesting 
of states is allowed. Here we prove that, without this capability of nesting (i.e., 
if one considers basic macro tree transducers), not every Y /E'LD-function 
can be computed. 

First we define a tree transformation Texv-ieaves which takes an input tree 
s over the ranked alphabet E = and delivers an output tree of 

which the height is equal to the number of leaves of s. This tree transformation 
can be computed by a Y I E LD- function. 
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Definition 6.33. Let S = and A = be two ranked 

alphabets and let g : Ta{Z\) be the function defined by g{a) = 7 ( 2 ^ 1 ). 

Then denote the tree transformation yieldgj^a) ’T^-^Ta by Texp--ieaves- 

Note 6.34- Texp^leaves £ YIELD. 

In particular, if the input tree 5 is a fully balanced tree over i7, then the 
height of Texp^ieavesis) is + 1. Next we prove that a basic macro 

tree transducer can only increase the height of an input tree linearly. 

Lemma 6.35. Let M be a basic macro tree transducer. There is a constant 
c > 0 such that for every s eT^, 

height{TM{s)) < c • height{s). 

Proof. Define c = max {height {^) 1^ = f*hs{q,<T),q G G E}. Next define 
the two statements K and L. 

K: For every s G T^, n > 0, g G height {TM,q{s)) < c • height{s). 

L: For every k,n > 0, ^ G RHS{Q, A/k,n), and u G 

(Tjs;)*, height{TM,^{(^)) < height{^) -h c • mx, where mx = 

max [height (si ), . . . , height(sk)}- 

We can prove K and L by using the proof by simultaneous induction, see 
Principle 3.11. 

IB: Let ^ G RHS{Q,A,Q,n) and uj = (). The proof is by structural 
induction on 

(i) ^ cannot have the form 

(ii) Let ^ = 5(^1, . . for some 6 G A^^^ and ^i, . . G T^iVn). Assume 
that L is true for ^i, . . . Then we can compute as follows: 

height 

= height{6{TM,^i ‘ , tm,^i ( w ))) 

= 1 -f max{height{TM,^i{iJi^)) 1 1 < * < /} 

< 1 + max [height {^i) -h c • max \ l < i < 1} 

(by induction hypothesis on 

= 1 + c • mx -h max [height {^i) 1 1 < « < /} 

= height{^) + c • mx. 

(iii) Let ^ = yj. Then we can compute: 

height{TM,^{(^)) 

= height{yj) 

= 1 

< height{^) -f c • mx. 
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ISl: Let s = a{si , . . . , 5jb) E n > 0, g G Then we can compute: 

height{TM,q{s)) 

< height(rhs{q, a)) + c • mx (by L) 

< c + c • mx 

= c • height{s). 

IS2: Let ^ G RHS{Q, fe, n) be the right-hand side of the rule of a basic 
macro tree transducer. Let uj G (T 27 )*'. The proof is by structural induction 
on 

(i) Let ^ = p{xi,^i , ... , 6 ) for some p G with / > 0 and ^ 1 , . . . ,^/ E 

RHS{Q, A, fc, n). Because M is basic, every G T^{Yn), i.e., it does not 
contain a state. We distinguish two cases: / = 0 and / > 0. 

/ = 0 : 



height {tm,^{oj)) 

= height {TM,p{si)) 

< c • height(si) (by K) 

< height{^) + c • mx. 

l>0: 



height{TM,^{uj)) 

= height{TM,p{si)[yj 4 - 1 < j < /]) 

< height{TM,p{si)) + max {height 1 1 < i < 0 

< c • height{si) -f max{height{l^j) \ l < j < 1} (W K) 

< c • mx + max [height {^j) \l < j < /} 

< 1 + max{height{^j) |1 < i < /} + c • mx 
= height{p{xi , 6 , • • • , 6)) + ^ 

(ii) Let ^ = 6 (^ 1 , . . . , 6 ) for some 6 G A^^^ and ^ , . . . , E T^(yn)* The proof 
is the same as in IB. 

(iii) Let ^ = 2 /j . The proof is the same as in IB. □ 

Lemma 6.36. YIELD — bas-MAC ^ 0. 

Proof. By Note 6.34, Texp^Uaves G YIELD. Assume that Texp^ieaves G has- 
MAC. Then, by Lemma 6.35, there is a constant c such that, for every 
s £ height{TM{s)) < c • height{s). But if s is a fully binary tree, then 
height{Texp-ieaves{s)) = + 1 which is a contradiction. Hence, 

'^exp— leaves ^ bas-JH AC . ^ 

So far a number of statements of the form r ^ C have been proved in 
this book, where r is some tree transformation and C is some class of tree 
transformations. In most of the proofs the relationship between the height or 
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size of the input tree and the corresponding output tree has been used. Here 
we would like to prove such a statement by means of another technique: the 
statement is 

there is a tree transformation, called rnyck, which is not in bas-MAC 

and the proof technique is based on considering the paths of output trees. 

First let us define the tree transformation TDyck- It transforms trees over 
the input alphabet E = into trees over the output al- 
phabet A = b'^^\ thus, in particular, rj^yck computes 

monadic trees. Intuitively, the transformation TDyck can be understood as 
follows (see Fig. 6.7 and the remark after the following definition). Consider 
an input tree s. Now traverse s depth-first right-to-left. Whenever the symbol 
7i is crossed in the direction “root to frontier” , then an a is produced as an 
output symbol; if it is crossed in the direction from “frontier to root”, then 
an a' is produced as an output symbol. Whenever the symbol 72 is crossed, 
then, similarly, either b or 6' is produced as an output symbol depending on 
whether the crossing direction is from “root to frontier” or from “frontier 
to root”. Formally, we define r^yck by means of a superlinear macro tree 
transducer. 

Definition 6.37. Let E = and A = b^^\ 

be two ranked alphabets. Define the tree transformation TDyck • 
Te — Ta by TDyck = tm where M = (Q^E^A^q^R^E) is the superlinear 
macro tree transducer given hy Q = = (c), and R contains the 

rules: 

1 ) q{x2,q{xi,yi)) 

2 ) q{7i{xi),yi) a{q{xi,a'{yi))) 

3 ) g(T2(a?i),2/i) ^ 6(g(xi,6'(2/i))) 

4 ) g(a,yi) -^2/1 ° 

Since sl-MAC C ATT, the transformation TDyck can also be specified by 
an attributed tree transducer. This approach is taken in Fig. 6.7 where the 
relation 



rD»cfc(Ti(<T(T2(72(a)),7i(<^(72(a),a))))) = aabb'a'bbh'b'a'c 



is shown. 

Now we show that TDyck is not in bas-MAC. The non-inclusion is due to 
the following reasons 

• the path languages of range{bas-M AC) are linear (context-free) languages 
and 

• the path language of range{TDyck) is the Dyck language D2 over {a, 6} and 
it is known that D2 is not a linear language. 
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Fig. 6.7. An illustration of TDyck 



We need some terminology about path languages. Let A he a, ranked 
alphabet. The path alphabet associated with Zi, denoted by pA^ is the finite 
set {(5,0) I S G U {(5,2*) | 6 G A^^^ for some fc > 1 and 1 < i < 

k}. For every ranked alphabet Zi, the path translation of A is the function 
path A • Ta V{{pAy) defined inductively as follows, (i) For a G A^^\ 
define pat/i^(a) = {(a,0)}. (ii) For t = 6{ti,...,tk) with 6 G A^^^ and 
fc > 1, define = {(5, i)p \ 1 < i < k and p G pathAiU)}. Let PATH 

denote the class of all path translations path Ay where ^ is a ranked alphabet. 
For a class C of languages, we use PATH{C) to denote the class of all path 
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languages path{L) for L E C. Note that, for every t G T^, path^(t) C (pA)*. 
Recall that £/,„ denotes the class of linear languages. 

At the same time we will also show that the path languages of ranges of 
top-down tree transformations are regular languages. Recall that Creg denotes 
the class of regular languages. 

Theorem 6.38. 

1 ) PATH{range{has-MAC)) C Cun- 

2 ) PATH{range{TOP)) C Creg- 

Proof. Consider a basic macro tree transducer M = {Q, U, A,qo, R, E) with 
rank{qo) = r -h 1, r > 0, and £* = (ti , . . . , <r)- We construct a linear context- 
free grammar G such that L{G) = path^{range{rM))- 
Define G = {N,pA, P, qnew) as follows. 

• N = {qnew} u {[g, 2*] I ^ G n > 0, 0 < z < n} 

• P contains the rule qnew [^o,0] and, for every i G {1, . . .,r} and w G 

pathAiU), the rule qnew [qoJ] w. 

Moreover, consider the rule q{(r{xi , . . . , xjk), t/i , . . . , t/n) ^ of P and con- 
sider a path (<5 i,/ci)((52,«2)...(6;,,k^) in pathAuQuXkUYniO- 

- If, for some i with 1 < i < p, the element 6 i is a state p G Q and 6 ^ Ajb , 
then the following rule is in P: 

[q, Ti] -H. {Si , /Ci)(62, /C2) . . . {Si-i, Ki - l](6i+i, /Ci+i) . . . 

...{ 6 ^^l,Kf,^l)tp 

where rj and are determined as follows: 

• if G A, then rj = 0 and V’ = 

• a 6 ^ = yj , then rj = j and 'ip = e. 

- If 5^-1 = p E Q and 6 ^ E Xk^ then the rule 

[9,0] ^ (^1,Ki)(^2,K2)---(^p-2,K,*-2)[P.O] 



is in P. 

- If, for every i with 1 < 2 < p, the element Si E A \JY , then the rule 

[q,ri\ (6 i,Ki)( 52,«2 )--.(^/.-i,«m-i)i/’ 

is in P where rj and rp are defined as in the first case. 

This ends the construction of the linear grammar. If M is a top-down tree 
transducer, then G is clearly a regular grammar because the first of the three 
conditions above never holds. 

Now it remains to prove the correctness of the construction. Let mx = 
max{n | ^ 0}. 

For every q E Q, i > I, and w E (pA)* , we can prove the following 
statements. 

1. [q^i] w iff there is an s G Ts such that u;(yi,0) G 

pathAuYmx{'^M,g{s)). 
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2. [g, 0] w iff there is an s G Ts such that w G po>thAuYmx{'^M,q{s)). □ 



In fact, the classes Cun and PATH(range{has-MAC)) are very close to 
each other: by dropping the second component of the elements of the path al- 
phabet, languages in Cun can be considered as elements of PATH{range{bas- 
MAC)). 

Example 6,39, Consider again the basic macro tree transducer M which was 
shown in Theorem 6.5. We repeat its rules: 

• ^ a{q{xi,l{yi)),q(xi,2{yi))) 

• ^!/i- 

Recall that the environment £* of M is the sequence (#). 

According to Theorem 6.38, we construct the linear context-free grammar 
G — {N,pA, P, qnew) as follows. 

• N = {qnew} C {[g, 0], [g, 1]} 

• P contains the rules 
~ qnew ^ [? ) 0] 

qnew 0) 

- the right-hand sides of rules of M have the following paths 

1. ((7,l)(g,l)(xi,0) 

2. ((7,l)(g,2)(l,l)(t/i,0) 

3. ((T,2)(g,l)(xi,0) 

4. ((7,2)(g,2)(2,l)(2/i,0) 

5. (yi,0) 

These paths give rise to the following rules in P: 

1. [«,0]-+((T,l)[g,0] 

2. [g,l]-(<T,l)[9,l](l,l) 

3. [g,0]-^((T,2)[g,0] 

4. [g,l]-(<^,2)[g,l](2,l) 

5. [g, !]-♦£. 

We find that the state [g, 0] never occurs in a derivation qnew =>g ^ 
some w G (pA)*, because every rule with [g, 0] in its left-hand side has a state 
in its right-hand side. □ 

Definition 6.40. The Dyck language D 2 is the formal language which is 
generated by the following context-free grammar G = ({A}, {a, a', 6, 6'}, P, A) 
where P contains the productions: 

• A -^AA 

• A aAa! 

• A ^ hAV 

% A-^e □ 

Lemma 6.41. ^ H\in- 
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Proof. It can be proved by a pumping lemma for linear languages (see 
for example Theorem 6.1 of [ABB97]) that the language L • L with L = 
{a” 6” I n > 0} is not a linear language. Now assume that £>2 G £/»n- Let R = 

| fc, /, m, n > 0} be a regular language and let h : {a, a', b, 6'}* 

{a, 6}* be the homomorphism which maps the letters a, a', b, V to the letters 

а, b,a,b, respectively. Clearly, L • L = h{D 2 O R). Since the class of linear 

languages is closed under intersection with regular languages and homomor- 
phisms (see for example Section 3.1 of [MS97]), it follows that L • L G Cun 
which is a contradition. □ 

Lemma 6.42. There is a string homomorphism h such that the Dyck lan- 
guage £>2 is the image of path/^{range{rDyck)) under h. 

Proof. Let A = be the ranked alphabet of Def. 

б. 37. Define the string homomorphism h : pA* — ► {a, a', 6, 6'}* by h{{a, 1)) = 

а, h((a', 1)) = a', h{{b, 1)) = 6, h{{V , 1)) = 6', and ft((c,0)) = e. 

Consider the macro tree transducer M = {Q, E, A, q, R, E) of Def. 6.37. 
For every s £ Tj;, the one and only leaf of TM,q{s) is labeled by t/i. Thus, 
the path language path^{range{rM)) can be generated by the context-free 
grammar G = (N,pA,P,qo) with N = {qo^q} and P contains the following 
productions. 

1) (c, 0) 

2 ) q-^qq 

3) g (a,l) q (a',1) 

4) g (6, 1) q (6', 1) 

5) q-^e 

Roughly speaking, the productions of G are obtained by taking the paths of 
the right-hand sides of rules of M; the first production takes over the role of 
the environment of M in the definition of tm- Hence, pathA{range{TM)) = 
£(G). It is clear that h(£(G)) = D 2 . □ 

Theorem 6.43. rnyck ^ bas-MAC. 

Proof. Assume TDyck 6 bas-MAC. Then, by Theorem 6.38, 
pathA{range{rDyck)) G £/»n- Since Cun is closed under homomorphisms, 
(according to Section 3.1 of [MS97]), it follows from Lemma 6.42 that 
D 2 G Clin which is a contradiction to Lemma 6.41. □ 

б. 2.7 The inequality ATT - lu-ATT 0 

We prove this inequality by showing a pumping lemma for tree transforma- 
tions induced by monadic one- visit attributed tree transducers and by con- 
structing an attributed tree transducer which does not meet the statement 
of the pumping lemma. 
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Definition 6.44. An attributed tree transducer is called monadic, if the 
ranked alphabets of input symbols and output symbols are monadic. 

The attributed tree transducers considered have the additional property 
that, for every input tree s, they always include the leaf of s in their deriva- 
tions and return to the root. 

Definition 6.45. Let A = (Att, E, A, ao, R, E) be a monadic one-visit at- 
tributed tree transducer and let s 

1) A is called leaf touching on s, if there is a synthesized attribute a and 

a l)-context v E Ca,i such that ao(£:) i.e., the 

derivation starting from ao{€) calls an attribute occurrence of the leaf of s. 

2) A is called root touching on s, if there is an inherited attribute b and 

a {A, l)-context v E Ca,i such that ao(e) , ^[^(^)]> derivation 

starting from ao(e) calls another attribute occurrence of the root of s. □ 

In the following we use the particular ranked alphabet Eq = 

, and we consider the language Ls = {7<5”(a) | n > 0}. 
Clearly, in order to compute an infinite output language, the attributed 
tree transducer should be leaf touching. 

Lemma 6.46. Let A be a monadic one- visit attributed tree transducer with 
input alphabet Eq. If there is an input tree Sq E such that A is not leaf 
touching on sq, then the set ta(L^) is finite. 

Proof. Let Sq be an input tree such that A is not leaf touching on sq . Due to 
the forms of Eq and Ls, for every tree s E L$ with height{s) > height(so), 
there is an m > 0 such that s has the form so[o ^ i"*(a)]. Since A is not 
leaf touching on A cannot distinguish between sq and any so[a 6"*(a)] 
with m > 1. Hence, for every m > 1, ta(5o) = Ta(sq[c^ ^ Since 

there is only a finite number of trees s in with height{s) < height{so), 
the language ta{Ls) is finite. □ 

We can also transform every leaf touching attributed tree transducer into 
an equivalent leaf touching and root touching attributed tree transducer. 

Lemma 6.47. Let A be a monadic one- visit attributed tree transducer with 
input alphabet Eq. Moreover, A has the property that, for every s E 
A is leaf touching on s. Then there is a monadic one- visit attributed tree 
transducer A' such that ta = ta> and, for every s E A' is leaf touching 
and root touching on s. 

Proof. Let A = (Att, Eq, A,oq, R, E) be a monadic one-visit attributed 
tree transducer with the properties mentioned. Construct the attributed tree 
transducer A' = {Att\ Eq, A, qq, R‘ , E^) as follows: 

• Att' — Attsyn U Att^^f^ where Att^^f^ = {upp \ (3 E A^^^} U AtUnh- 
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• for every b G Attinhy define E\b) = £'(6) and for every /3 G define 
E'{upp) = p. 

• R is equal to R except for the following: 

- If there is a rule in Ra of the form a(7r) ^ v(/?) for some a G Attayn, 

V G and /? G then replace this rule by a(7r) — ► vupp{ir), 

- Rflf there is a rule in Rs of the form 6(7 t1) — ► v{/3) for some b G AtUnhi 

V G and p G then replace this rule by the rule 6(7 t1) — ^ 

vt/p^(7r). 

Moreover, for every /? G the rule up^(Trl) — up^{'K) is in R'^, 

- R^ :li there is a rule in R^ of the form 6(7 t1) ^ v{!3) for some 6 G AtUnhj 

V G and P G then replace this rule by the rule 6(7 t1) 

vupp{ir). Moreover, for every P G add the rule up^(Trl) — »► up^(7r) 
to ily. 

It is obvious that, for every input tree s G A' is leaf- touching and 
root-touching on s and that ta = ta*- □ 

Now we prove the pumping lemma for tree transformations induced by 
monadic one- visit attributed tree transducers. 

Lemma 6.48. Let A be a monadic one- visit attributed tree transducer with 
input alphabet Eq- Moreover, A has the property that, for every $ E Ls, A 
is leaf touching and root touching on s. 

There is a constant no, such that, for every n > no and input tree j 6 ^(a), 
there are «i, /C 2 € and K 3 G and there are vi, ^ 2 , ^ 3, ^4 G 

(^(1))* and V 5 G and 

• 'yS^{a) = K 1 K 2 K 3 , TA{j 6 ^{oi)) = viV2^^3^^4^5 and K 2 ^ e 

• for every k >0, ta{kiK 2 K 3 ) = vii;2*^t;3V4*'v5- 

Proof. Let A = {Ati, Eq, A,ao, R, E) be a monadic one- visit attributed 
tree transducer with the properties mentioned. Define no = card{Attayn) • 
card{Attinh)- 

Now consider the derivation of A on the input tree s = yS^{a) for 
some n > no. Since A is leaf touching and root touching on s, there are 

G 6 V € € 

Attinh, and wi, . . .,Wn € and there is a 6 G AtUnh and a u; € 

such that ao = ai„ and 

^ Ay$ ^ia»j(l) 

=>A,s Ul W2 0.j(ll) 



=^A.> «1 «2 • • • «n+l a.„+, 

=>A,, «1«2 

=»>!.. « 1«2 ...U„+iVWibj,{l’^) 



=i^A,$ «1«2 ...«n+lV«>l- .U'»tj„(l) 

A . u^ Un .. V UH ... w h( fr\. 
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The fact that A is one- visit is used in the form of the above derivation in 
the following way. There are no inherited attributes appearing in the deriva- 
tion before an inherited attribute of the leaf a is involved. In fact, if there 
was at any point, then a synthesized attribute should appear later so that 
the derivation touches the leaf of s. However, in this case an inherited at- 
tribute of 6 would depend on one of its synthesized attributes, contradicting 
the requirement that A is one- visit. Moreover, once an inherited attribute 
of ot is reached, a synthesized attribute no longer appears in the derivation. 
This is because if it did, then an inherited attribute of 6 would depend on 
one of its synthesized attributes, again contradicting the requirement that A 
is one- visit. 

Since n > no = card(Attsyn) * card(Attinh)i there are / and /' such that 
1 </</'< n and a,-, = a,j, and • Let Iq and Iq be indices 

such that = ai^, and . Let these indexes be fixed for 

the rest of the proof. 

Define ki = and K3 = Then obviously, 

j6^(a) = K1K2K3. Since Iq < 1^2 ^ 



Define 






Vi 


II 


3 




V 2 


= «Jo+l • • • 


uv 

‘0 




V 3 


= «/i+l • • • 


Un+1 . 




V 4 


= «'n-J(,+2 


. . . U>n— /o+l 






= l«n-/o+2 


. . . WnWWE 


where we = E{b) 


Since 


«»io - ^^—'1,+* 


= i^in-io+i » ^Le piece «2 of the input tree s 



can be repeated without changing the derivation of A on the pieces k\ and 
/C3. This means that, for every A: > 0 and Sk = the derivation of A 

on Sk looks as follows: 

«o(e) 

=^A,>h ^2 *^3 W4 

Thus, for every fc > 0 , ta{kiK 2K3) = vi V3 v* V5. □ 

Lemma 6 . 49 . ATT ~ Iv-ATT ^ 0 . 

Proof. Consider the following attributed tree transducer A = 
{Ati^EQ^A^aQ^R^E) with Eq as before, A = Attsyn = 

{do, Sa, «c}j and Attinh = The rules of A are as follows: 

• Ay = {ao(7r) — ► Sa(7Tl), n(7rl) Sc{ttI)} U {sa(7r) -H. a, ScC^r) — ► oc} 

• Rs = {sa(7r) — ► aSo(7rl), ScC^r) cScC^rl), ib{T^l) — ► bn{Tr)} U {ao(7r) — a} 
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• Ra = {so(7t) -+ ij(jr), Sc(7t) -h- a} U {ao(7r) -+ a}. 

We note that the rules appearing in the second members of the unions defining 
Rj,Rs, and Ra are just dummy rules because they will never be used in any 
derivation starting from the attribute occurrence ao{e) of an input tree. 

Then certainly A is noncircular, but A is not one- visit because of the 
rule u(7t 1) — The tree transformation induced by A on Ls has the 
property that 

TA{yS^(a)) = a”6”c^(a) 

for every n > 0. 

Now assume that ta G Iv-ATT. Since ta{L 6) is infinite, by Lemmas 6.46 
and 6.47 we can assume that A is leaf- touching and root- touching on every 
input tree. Then, by Lemma 6.48 there is an no with certain properties. 
Consider the input tree s = yd^{a) for some n > no. Then, by the pumping 
lemma, there are /ci,/C2)«3 and there are v\,V2,vz^V4 G and G 

such that y6^{a) = k\K 2 Kz, r{y6^{a)) = i;ii; 2 t; 3 t; 4 t ;5 and, for 
every A: > 0, t{kiK 2 Ks) = i;it;2*V3V4*V5. 

Clearly, when the input tree s is pumped by repeating K2, the input tree 
grows (note that «2 ^ ^)- Hence, because of r^(75”(a)) = a”6”c”(a), the 
output will also grow. And thus, V2 or V4 cannot be empty and they occur 
more often in the output of a pumped input tree than in the output of s. On 
the other hand, neither V2 nor V4 can contain more than one of the letters 

а, b, and c, otherwise these letters would not be in alphabetical order in the 

output of the pumped input tree. Then the output cannot be of the form 
d^b^c^{a), because the number of no more than two letters can change in 
it. This is a contradiction to the assumption that ta G Iv-ATT however the 
Ac’s and v’s are chosen. □ 

б. 2.8 The inequality nn-ATT - TOP ^ 0 

Here we follow the same line of proof as in Theorem 6.43. We construct 
a nonnested attributed tree transducer A such that the path language of 
range{TA) is a linear language. Then difference set is not empty because the 
path languages of ranges of top-down tree transducers are regular languages. 

Definition 6.50. Let S = and A = { 7 p^ 72 ^\ De- 

fine the tree transformation Trepeat ' Tj; Ta by Trepeat = Ta where 
A = {Ati,E,A,a,R,E) is the nonnested attributed tree transducer given 
by Alt = Attsyn U Aitinh with Attgyn = {a} and AtUnh = E{b) = a, 
and R consists of the sets R^ and Ra. 

H,: 

1) o(7t) -.• 7i(a(7rl)) 

2 ) 6 ( 7 t 1 ) J2{b{ir)) 



and Rn,: 
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3) a{ir) — ► 6(7t) 

It is obvious that ta = T 1 T 2 (<^)) I ^ > 0}* ^ 

Note 6.51. By standard arguments, {( 71 , 1 )”( 72 , 0) I ^ > 0} ^ Creg- 

Theorem 6.52. Trepeat ^ TOP. 

Proof. Assume Trepeat G TOP. Then, by Theorem 6.38 part 2 , L = 
pathA{range{Trepeat)) G Creg- However, by Note 6.51, L ^ Creg contradicting 
the assumption. □ 

6.2.9 The inequality HOM - Z-TOP ^ 0 

This inequality has already been proved in Lemma 3.33. 



6.3 Composition Semigroup Generated by TOP, ATT, 
and MAC 

The content of this section 

In this section we consider again the problem of determining whether an inclu- 
sion (or equality or incomparability) relation holds between any two expres- 
sions built up by composition from a fixed set H of fundamental tree trans- 
formation classes. The set H now will be chosen as H = {TOP, ATT, MAC}. 

We recall that the same problem has been considered in Sect. 3.7, where 
H consisted of certain top-down tree transformation classes. We also recall 
that in paragraphs (a)-(d) of that section we gave a general method with 
which such an algorithm can be constructed. We proved in Lemma 3.48 that 
the method is correct. Then we successfully implemented this method for the 
instance of H mentioned. 

In this section we will apply the general method to the set 

H = {TOP, ATT, MAC} 

thereby constructing an algorithm for H. 

We recall from Sect. 3.7 that, to develop the algorithm, we considered 
two semigroups in terms of H: the free semigroup with the operation of 
concatenation (denoted by •) and the semigroup 

[H] = [Ui o • • • o Um \ m > l,Ui E H for every 1 < i < m} 

generated by H with the operation of composition (denoted by o). We also 
used the homomorphism \ \ : [H] which is the unique extension of the 

identity mapping over H and for which, therefore, 

for all elements Ui,...,Um G H. Moreover, we denoted the kernel of the 
homomorphism | | by 0 . 
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Another application of the general method 

In this subsection we apply the general method to the set H = 
{TOP, ATT, MAC}. In Sect. 3.7 it transpired that the implementation could 
be performed in five steps. Here we also split the implementation into five 
corresponding steps. 

STEP 1. In the first step we produce a string rewrite system S, which is our 
candidate for presenting [H]. In the present case, let S consist of the seven 
rules: 

1) TOP • TOP TOP 

2) TOP • MAC MAC 

3) MAC • TOP MAC 

4) TOP • ATT MAC 

5) ATT • TOP ATT 

6) ATT • MAC ATT • ATT 

7) MAC -ATT ^ MAC MAC 

Roughly speaking, the rewrite rules do not change the class of tree trans- 
formations to which they are applied. Formally, this is expressed in the fol- 
lowing lemma. 

Lemma 6.53. 9 

Proof. First, in the same way as in the proof of Lemma 3.49, we show that 
elements of S are valid in [H], i.e., that, for every u v £ S, the equality 
|u| = |v| holds. Therefore we recall that the validity of the rules 1), 2), 3), 
4) and 5) has already been verified in Theorem 3.39, Corollary 4.38, and 
Theorems 4.40, 6.25, and 5.47, respectively. With the following computation, 
we show that MAC o ATT = MAC o MAC, which establishes 7). 

MAC o ATT 
C MACoMAC 
C MACoTOPoYIELD 
C MACoYIELD 
C MAC o sl-MAC 
C MAC o ATT 

Next we prove that ATT o MAC 



c 


ATT 0 MAC 

ATT o HOM o sl-MAC 


(Corollary 4.39) 


c 


ATT oTOPo sl-MAC 


(Corollary 3.32) 


c 


ATT o sl-MAC 


(Lemma 5.46) 


c 


ATT o ATT 


(Note 6.9 and Theorem 6.23) 


c 


ATT o MAC 


(Lemma 6.1). 



(Lemma 6.1) 

(Lemma 4.34) 

(Theorem 4.40) 

(Lemma 4.32) 

(Note 6.9 and Theorem 6.23). 

= ATT o ATT, i.e., that 6) is valid. 
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Finally we note that the inclusion 6 can be derived from the validity 
of the rules in S in the same way as in the proof of Lemma 3.49. □ 

STEP 2. Next we give a set REP which is our candidate for a set of 
representatives of the congruence classes of O5. Let 

REP = {TOP} U {ATT^ I n > 1} U {MAC^ I n > 1}. 

STEP 3. In this step, we construct the inclusion diagram of the tree 
transformation classes represented by the elements of REP, i.e., that of the 
set \REP\ = {|u| I u E REP}. Our candidate for this inclusion diagram can 
be seen in Fig. 6.8. We observe that the guessed diagram is now rather special, 
in fact it is a chain. Therefore we do not have to follow the method presented 
in Sect. 2.2 to prove that it is the inclusion diagram of \REP\. Instead we 
observe that there are no incomparable classes and therefore it is sufficient 
to prove that all inclusions shown by the diagram are proper. 

Lemma 6.54. Figure 6.8 shows the inclusion diagram of the set \REP\ = 
{\u\\ueREP}. 

Proof. In Parts 1, 2, and 3, we show that all inclusions shown by the con- 
jectured diagram are proper. 

Pari 1. Here we recall that the inclusion TOP C ATT has already been 
proved in Theorem 5.44. 

Part 2. We now prove that ATT^ C MAC^, for every n > 1. 

We note that this statement has already been proved in Lemma 6.2 for 
n = 1. We now prove it for an arbitrary n > 1. 

In Lemma 6.1, it was shown that ATT C MAC. Hence ATT^ C MAC^ 
also holds. We still need to prove that the inclusion is proper. We perform 
the proof by showing a tree transformation which is in MAC^ but not in 
ATT^. Consider the macro tree transducer defined in Lemma 4.23, which 
induces the tree transformation r = {(7*'(a),7^ (a)) \k > 0}. Then the 
tree transformation 



i.e., the n-fold composition of r is in MAC^, for every n > 1 (recall the 
definition of exp in Lemma 4.24). Thus, for every A: > 0, heigkt{rn{'y^ {a))) 
= si2e(r„(7*(a))) = exp{n,k) + 1. 

Assume that Tn E ATT^, i.e., that, for every 1 < ^ < n, there is an 
attributed tree transducer A{ = i7», i7,+i, a,*, such that E\ = 

■Sn+i = and 



T4, O...OTA„=Tn. 

Now, for every I < i < n, let c,- be the constant associated with the at- 
tributed tree transducer A,* by Lemma 5.40 such that, for every s E Tj;., 




6.3 Composition Semigroup Generated by TOP, ATT, and MAC 211 





ATT* 



MAC’*+^ 



MAC^ 




MAC^ 



MAC 



TOP 



Fig. 6,8. Composition semigroup generated by {TOP^ATTyMAC} 



height{rAi{s)) < C{ •size{s). Moreover, let di be the constant associated with 
the ranked alphabet Si by Lemma 2.13. Then, for every 1 < i < n and 
s € Tsi, size{s) < holds. 

Then, for every fe > 0, we can approximate height{Tn(y^ (a))) by c,’s and 
di ’s as follows. Since Tn = ^ o . . . o , there are trees si G , . . . , Sn+i ^ 

Tr^+i such that y^{a) = si, rAi(si) = «2, • • • , rAn(«n) = Sn+i and also 
rn(7*^(a)) = «n+i- Consequently, by Lemmas 5.40 and 2.13, height(si^i) < 
Ci -size{si) and size{si) < for 1 < i < n. Hence, for rn( 7 *(o:)) we 

get the approximation 
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height{Tn{j'‘ (ce))) < Cn • dn“"‘ 

(Note that size{si) = size{j^{a)) = fc + 1.) Since height{Tn{'y^ {a))) = 
exp{n, A:) + 1, we obtain 



exp{n, Ar) + 1 < Cn • , 

for every A: > 0. However, this is an obvious contradiction. 

Part 3. We prove that, for every n > 1, the inclusion MAC'^ C 
holds. 

The inclusion MAC^ C can be seen as follows. For n = 1, we 

have 



MAC 

= TOP o YIELD (by Theorem 4.37) 

C ATT o ATT (by Lemma 5.44 and Corollary 6.24). 

Then, by induction on n we obtain 



C 

C 

c 

c 



MAC'^ o MAC 

o mac 

o top o yield 

o yield 



(by induction hypothesis) 
(by Theorem 4.37) 

(by Theorem 5.47) 

(by Corollary 6.24). 



We still have to prove that the inclusion is proper. 

Therefore, recall from Example 3.29 the definition of a fully balanced 
binary tree over E = We will construct an attributed tree trans- 

ducer A = {Atty Ey Ay o, Ry E) such that, for every A; > 1, rA(sfc) = S 2 k where 
Sk denotes the fully balanced binary tree of height k. Let Att contain the 
inherited attribute 6 and the synthesized attribute Gy E = A = 

E{b) = a, and R contains the following sets of rules: 



• Ra: 

1) a(7r) — ► (7(a(7r2), a(7r2)) 

2) 6(7t1) — 6(7t) 

3) 6(7t 2) — )> a(7rl) 

• Ra: 



4) a(7r) ^ c'(6(7t), 6(7t)) 

Then we can prove the following statement by an easy induction on k: for 
every Ar > 1, 

a(e) S2fc[a<-6(e)]. 
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We note that, in the induction step, we have used the fact that, for every 
Jb',ik"> 1, 

Sjb/[a ^ Sfc//[a <(— i>(£^)]] = Ski-i+k"[oi 
Now denote by r. 

Then it follows from the construction of A that, for every fc > 1, 
heighi{r{sk)) = exp{n + 1, k). 

Next we prove by contradiction that r is not an element of the class 
MAC^. Thus assume that r G MAC^. Then by Lemma 4.24, there is a 
constant c > 0 such that, for every s the approximation 

height{r{s)) < exp(n^c • height{s)) 

holds. Hence we obtain that, for every > 1, 

height{r{sk)) < exp{n^ c • k). 

Combining this approximation with the calculation of height{r{sk)) 
above, we obtain, for every Ar > 1, 

exp{n + 1, fc) < exp{7ij c • k). 

Obviously, this is a contradiction and hence, r ^ MAC^^ by which we prove 
our lemma. □ 

STEP 4- In this step we show that, from every element of , we can compute 
the corresponding representative. 



Lemma 6.55. There is an algorithm that computes, for every w G a 
representative u G REP such that w =>g u. 

Proof. We prove the lemma by induction on length{w). 

Let length{w) = 1, implying that w £ H. Then we also have w G REP^ 
by the definition of REP. Hence, for u = w,we have u G REP and w u. 

Now suppose that length(w) > 1, which means that w = x • U, for some 
X G and U E H. Moreover, by induction hypothesis on length{w)^ a 
representative v G REP can effectively be computed such that x v. 
Then, regarding v, three cases are possible, each of which has three subcases 
concerning U. For each subcase, we compute u. 

Case 1: v = TOP. Then w = x • U ^ ' U = TOP • U. 

• Subcase U = TOP. Then we have u = TOPy because TOP • U = TOP • 
TOP =>s TOPy by rule 1). 

• Subcase U = ATT. Then u = MACy because TOP • U = TOP • ATT =>s 
M AC y by rule 4). 
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• Subcase U = MAC. Then u = MAC, because TOP-U = TOP -MAC =>5 
MAC, by rule 2). 

Case 2: v — ATT^, for some n > 1. Then w = x U =>^5 v • = ATT^ • U. 

• Subcase U = TOP. Then we have u — ATT^, because ATT^ • U = ATT^ • 
TOP =^s ATT^, by rule 5). 

• Subcase U = ATT. Then u = because ATT^-U = ATT^-ATT = 

ATT”+i. 

• Subcase C/ = Then u = because ATT” • U = ATT^ • 

MAC =>5 by rule 6). 

Case 3:v = MAC^, for some n > 1. Then w = x-U =>5 v = MAC^-U 

• Subcase i7 = TOP. Then we have u — MAC^, because MAC^ • U = 
MAC^ • TOP =>5 MAC^, by rule 3). 

• Subcase U = ATT. Then u — MAC^'^^, because MAC'^ • U = MAC^ • 
ATT =>5 MAO^+^ by rule 7). 

• Subcase P = MAC. Then u = MAO”+^ because MAC” • C/ = MAC” • 
MAC = MAC”+^ 

We have finished the proof of our lemma. □ 

STEP 5. We have now constructed the desired algorithm. This is declared 
in the following theorem, which can be proved just as Theorem 3.54 was 
proved in Sect. 3.7. Therefore we omit the proof here. 

Theorem 6.56. Let H = {TOP, ATT, MAC}. Then there is an algorithm 
that decides, given two arbitrary tree transformation classes U\o- • -oUm and 
V\ o • • • oVn^ where Ui, Vj G H, for every 1 < i < m and 1 < i < n, which 
one of the conditions 



(i) 


Uio-- 


■oUm 


= 


Vio- 


■oVn, 


(ii) 


Uio-- 


■oUm 


c 


ViO- 


■ov;, 


(iii) 


Vio- 


■0V„ 


c 


Uio- 


■ ■ ® Um 1 


(iv) 


Uio-- 


•oUm 


N 


Vio- 


■oV„. 



holds. 

An application of the algorithm 

In this subsection, as an example, we apply the algorithm for two tree trans- 
formation classes. For example, consider the two tree transformation classes 

ATT o TOP o MAC o ATT and TOP o MAC o ATT o TOP. 

By Lemma 3.48 we first apply the algorithm described in Lemma 6.55 to 
compute an element u G REP such that ATT • TOP • MAC • ATT =>-5 u. 
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As in Sect. 3.7, we can construct a convenient table for applying the 
algorithm in Lemma 6.55. For H = {TOP, ATT, MAC}, the table looks as 
follows. 





TOP 


ATT 


MAC 


TOP 


TOP 


MAC 


MAC 


ATT' 








imxmrsm 




ESSSill 


ESSSiil 



We recall that the table, denoted by T, has the following properties. Its rows 
are labeled by elements of REP and its columns are labeled by elements of H. 
(In fact, REP is now an infinite set, hence the table should have an infinite 
number of rows. However, all elements of the form ATT^ (resp. MAC'^) of 
REP, where n > 1 can be handled similarly, therefore two rows are sufficient 
to represent the infinite number of cases.) 

As we saw in Section 3.7, for v G REP and U E H, the entry T{v, U) at 
(v, U) is also an element of REP such that v • U T{v, U). 

Again, if there is a t/; = i/i • i /2 • * , the following small program 

computes u G REP, for which w u. 

u := Ui] 

for f = 2 to n do 

u:= T{u,Ui) 

Applying the program to u; = ATT • TOP • MAC • ATT, we obtain 
u; = ATT • TOP • MAC • ATT 
which implies that 



ATT o TOP o MAC o ATT = ATT^. 

In a similar way we obtain 

TOP • MAC • ATT • TOP MAC^, 

implying that 

TOP o MAC o ATT o TOP = MAC^, 

From the inclusion diagram in Lemma 6.54, we can see that MAC^ C 
ATT^. Hence we have also obtained 

TOP o MAC o ATT o TOP C ATT o TOP o MAC o ATT. 

Presenting [H] by a terminating and confluent rewrite system 

In the same way as in Sect. 3.7, we can show that the string rewrite sys- 
tem S we have given is also terminating and confiuent. Moreover, we have 
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chosen REP in such a way that REP = IRR{Hy S) holds, i.e., that the irre- 
ducible elements of if"*" with respect to 5 are exactly the representatives we 
gave. This makes it possible to compute more easily, for every w G the 
representative u G REP, such that w u. 

Lemma 6.57. REP = IRR{H,S) 

Proof. Recall that REP = {TOP} U {ATT^ |-» > 1} U [MAC^ I n > 1}. 
The fact that REP C IRR(H, S) is obvious, because none of our seven 
rules can be applied to TOP, ATT^, or MAC'^, n > 1. The converse 
inclusion IRR{H,S) C REP can easily be seen also. Let us denote the 
set of irreducible elements of length n with respect to S by IRRn, for 
n > 1. It is sufficient to show that IRRn C REP, for every n > 1. In 
fact, IRRi = {TOP, ATT, MAC}, hence we have IRR\ C REP. Moreover, 
IRRn = {ATT^,MAC^}, for every n > 2 . This can be seen as follows. On 
the one hand, ATT^ and MAC^ are clearly irreducible. On the other hand, 
take a string w G with length{w) = n which differs both from ATT^ 
and MAC'^. We show that w cannot be irreducible with respect to S. We 
observe that w is either TOP^ or it contains at least two different symbols 
from H. The string TOP^ cannot be irreducible because rule 1) is applicable 
to it. Furthermore, if there are two different symbols X and Y 'm w, then 
there are also two different adjacent symbols X' and Y' in w , i.e., X* Y' is a 
substring of w. However, in this case we can easily check that there is a rule 
among rules 2)-7) with left-hand side X‘ • Y' irrespective of X' and Y'. Then 
w cannot be irreducible, because this rule applies to it. Thus, IRRn C REP 
for n > 2 also. □ 

Next we show that S is terminating and confluent. 

Lemma 6.58. S is terminating and confluent. 

Proof. First we prove that S is terminating. We observe that rules of S 
have the property of not increasing length which means that, for each rule 
u V G 5, we have length{u) > length{v). Hence, for every derivation 
w X, also length{w) > length{x). This implies the equivalence between 
the statements 

• there exists an infinite sequence wq =^5 wi =>5 W 2 - • • with W{ G for 
i = 1 , 2 ,... (i.e., that S is not terminating) 

• there exists a string x E H* such that x x. 

Next we observe that if x x, for some x G , then none of the rules 
l)-5) could be applied during that derivation, because they are strict length- 
reducing rules and thus any application of them would yield a decrease of the 
length of the string to which they were applied. Hence x x if and only if 
X X by applying only rules 6 ) and 7), which are 
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6) ATT • MAC ATT • ATT and 

7) MAC • ATT MAC • MAC. 

Let a; = xi • TOP • a?2 * TOP • . . . • TOP • Xm, for some m > 1 and 
a?i,...,aryn G {ATT, MAC}*. Then we obtain that x x if and only if 
there is an i with 1 < i < m such that x,- ^ J x,- . 

Therefore we can conclude that S is not terminating, if and only if there 
exists a string x G {ATT, MAC}*, such that x x, by applying only rules 
6) and 7). (Note that the latter condition is superfluous because rules l)-5) 
are not even applicable to x.) In other words, S is terminating if and only if 
such an x does not exist. We will prove that there is no x with this property. 

To this end, we deflne, for every string x G P*, the weight weight{x), 
which is a natural number. Roughly speaking, weight(x) is the sum of the 
position numbers of the signs which are between an ATT and a MAC or 
between a MAC and an ATT, i.e., which are either in the situation . . . ATT* 
MAC ... or in the situation . . . MAC • ATT ... in x, where the number of the 
position is considered from right to left. More formally, we deflne weight{x) 
by induction as follows. 

(i) If X = e, then weight{x) = 0. 

(ii) If X = MAC • x', for some x' G {ATT, MAC}*, then 



weight{x) = | 



weight{x*) + length{x^) if the first letter of x' is ATT 
weight{x') otherwise 



(iii) Moreover, if x = ATT • x', for some x' G {ATT, MAC}*, then 



weight{x) = | 



weight{x*) -f length(x^) if the first letter of x' is MAC 
weighi{x^) otherwise 



For example, weight{e) = 0, weight(ATT) = 0, weight{ATT • ATT) = 
0, weight{ATT • MAC) = 1 and weight{ATT • MAC • ATT • MAC) = 6. 

Finally we state that weight is defined in such a way that any application 
of rules 6) and 7) to a string x G {ATT, MAC}* reduces the weight of x, i.e., 
if X ^5 2/ by applying either 6) or 7), then we have weight{x) > weight{y). 
This, of course, proves that x ^ J x for no x G {ATT, MAC}* and hence 
that S is terminating. 

We must still verify that both 6) and 7) reduce weight{x), for every 
X G {ATT, MAC}* . Assume that we have x =^5 y by applying 6). This means 
that there are x', 2/ G {ATT, MAC}* such that x = x' • ATT • MAC • 2/ and 
2/ = • ATT • ATT • 2/. Then u;eiflfht(2/) = iyei5fh^(x) — 1 if the first symbol of 

2/ is MAC and u;e2flfht(2/) = u;e25f/i^(x) — (/en^t/i(2/') + l) otherwise. In either 
case, weighi{x) > weight{y). A similar reasoning can be applied for rule 7). 
This completes the proof that S is terminating. 

We now sketch how to prove that S is confluent as well. As in Lemma 
3.53, we can prove that REP is a set of representatives for <=>5 which means 
that every congruence class of *0^5 contains exactly one element of REP. 
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On the other hand, we have seen that REP = IRR{Hj S) implying that 
every congruence class of contains exactly one irreducible element for S. 
However, by Lemma 2.6, this exactly means that S is confluent. 

This finishes the proof of our lemma. □ 

The importance of Lemmas 6.57 and 6.58 is that, when computing the 
representative in REP for an arbitrary string w £ we do not have to 
use the algorithm described in Lemma 6.55. 

Instead, we can do the following. Given a string w G we compute an 
irreducible element u G IRR{H, 5), so that w u. Such a u is always reach- 
able, because S is terminating. On the other hand every derivation starting 
from w reaches the same irreducible element u, because S is confluent. This 
irreducible u will be in REP, because IRR{H, S) = REP. 

For example, consider again the tree transformation classes 

ATT o TOP o MAC o ATT and TOP o MAC o ATT o TOP. 



For w = 


ATT • TOP ■ MAC ■ ATT, we 


can compute u G IRR{H, S) by 




ATT • TOP ■ MAC ■ ATT 




=>s 


ATT ■ MAC ■ ATT 


(by rule 2) 


=>5 


ATT ■ MAC ■ MAC 


(by rule 7) 


=>S 


ATT ■ ATT ■ MAC 


(by rule 6) 


=^S 


ATT ■ ATT ■ ATT 


(by rule 6). 


But also 


ATT ■ TOP • MAC ■ ATT 




=>s 


ATT • MAC ■ ATT 


(by rule 5) 


=>s 


ATT ■ ATT • ATT 


(by rule 6). 


Similarly, for w = TOP MAC ATT TOP, we have w MAC -MAC. 



Since 



MAC o MAC C ATT o ATT o ATT, 
we again obtain the inclusion 

TOP o MAC o ATT o TOP C ATT o TOP o MAC o ATT. 
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In Chaps. 4 and 5 we investigated two formal models of syntax-directed se- 
mantics which are able to handle context, the macro tree transducer and the 
attributed tree transducer, respectively. Whereas the first one handles con- 
text in an implicit way by allowing the meaning names (i.e., states) to have 
parameters, the second one deals with context information in an explicit way 
by introducing context names (i.e., inherited attributes). 

In this chapter we study a formal model, called the macro attributed tree 
transducer, which integrates both approaches. From the point of view of a 
macro tree transducer, a macro attributed tree transducer is not restricted 
to recursive calls to states on input trees in a strictly descending way, but an 
arbitrary tree walk is allowed (as for attributed tree transducers). From the 
point of view of attributed tree transducers, a macro attributed tree trans- 
ducer contains attributes, which are not restricted to denote basic values, i.e., 
trees, but which may denote functions over trees (as the states of macro tree 
transducers can do). The connection between these transducers is shown in 
Fig. 1.33. 

Because of the possibility of arbitrary tree-walking over the input tree, 
macro attributed tree transducers share with attributed tree transducers the 
property of a possibly non-terminating derivation relation. Hence, we also 
have to discuss here the topics of circularity and a circularity test. 

Since the growth of output trees with respect to the corresponding input 
trees can be deduced easily from a decomposition result of macro attributed 
tree transducers, we change the order of the corresponding sections appro- 
priately. The structure of this chapter is as follows. 

1. Basic definitions 

2. Induced tree transformation 

3. Characterization of macro attributed tree transformations 

4. Composition and decomposition results 

5. Height property 

6. Bibliographic notes 
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7.1 Basic Definitions 

Since a macro attributed tree transducer should be able to tree-walk on its 
input trees, we define this model in the same framework as the attributed 
tree transducer. That means that we use the notion of path variable and the 
separation of the input tree from the sentential form (see Fig. 5.1(b)). 

Definition 7.1. Let As and Ai be two disjoint ranked alphabets, ^ be a 
ranked alphabet such that A H {As U A») = 0, and fc,n > 0 be integers. 
The set RHS{As^Ai^A^k,n) of right-hand sides over A,, Ai, A, k, and n 
is the smallest subset RHS of ^1) -i U Yn) satisfying the 

following conditions. 

(i) For every a G with / > 0, 1 < i < fc, and ^i, . . E RHS, the 

term a(7T2, , . . . , 6) is in RHS, 

(ii) For every b G with / > 0 and ^ RHS, the term 

6(7t, ^ 1 , . . . , ^/) is in RHS, 

(iii) For every 6 G A^^^ with / > 0 and ^i, . . . , G RHS, the term 6{^i ,,, ,,(i) 
is in RHS, 

(iv) Yn C RHS, □ 

The definition of a macro attributed tree transducer follows the outline 
of the definition of an attributed tree transducer. 

Definition 7.2. A macro attributed tree transducer is a tuple N = 
{Att, U, A, ao, R, E), where 

• Att is a ranked alphabet, called the set of attributes; every element in 
Att has at least rank 1; Att is partitioned into the disjoint sets Attsyn and 
Attinhj of which the elements are called synthesized attributes and inherited 
attributes, respectively. 

• E and A are ranked alphabets such that Att fl (i7 U ^i) = 0, called the 
input alphabet and the output alphabet, respectively. 

• ao G Att%V‘^ for some r > 0 is a designated attribute, called the initial 
attribute, 

• ii is a set U<r6i? finite sets Ra of rules which satisfies the conditions: 

(1) For every a G Atti^t^^ with n > 0 and a G with k >0, the set Ra 
contains exactly one rule of the form 

a(^,2/i,- -,2/n) 

where ^ G RH S{Attsym AtUnh, A, k, n), 

(2) For every 6 G Att\^^^^ with n>0,crE E^^^ with k > I, and I <i < k, 
the set Ra contains exactly one rule of the form 

b{wi,yi,,.,,yn)-^^ 

where ^ G RH S{Attsym AtUnhi A, k, n). 
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• E = (Ei^E^) is the environment^ where E\ = G and 

E 2 : Attinh Ta{Y) is a function such that for every b G with 

> 0, E2{b) G T^(n). □ 

Example 7.3. As example, we compute, for a given input tree s over 
the sum of the squares of the lengths of the paths of s by means 
of a macro attributed tree transducer Ngsq- More precisely, for a given input 
tree s, we compute the number 

^ (length{w))^ 

w^occ(s)Alabel{s 

As usual, we represent natural numbers by monadic trees over the ranked 
alphabet 

First, we need an attribute which calculates the sum of the numbers 
{length{w)Y for every leaf w. Let us call this synthesized attribute sum; it 
has two rules which are elements of Ra and Ra, respectively: 

• Rffi 

— sum(7T,2/i) — ► sum(7rl, sum(7r2, 2 / 1 )) 

• Ra'. 

- sum{ir,yi) — ► squ{ir,yi) 

If the synthesized attribute sum arrives at a leaf labeled by a, then the 
inherited attribute squ is called which calculates the square of the length of 
the path from this leaf to the root of the input tree. Here the equation 

= (2n — 1) + (n — 1)^ 

is used which is implemented by using an inherited attribute double; it keeps 
and updates the numbers (2n — 1). There are the following rules for squ and 
double where we have omitted as usual parentheses around the unary 7: 

• Rffi 

- squ{7rl, yi) double{ir, squ{TT, t/i)) 

- sgu(7r2,2/i) dou6/e(7r, sgu(7r, 1/1)) 

— dou6/e(7rl, 2/1) — > yjdouble^w^yi) 

— dou6/e(7r2, 2/1) —► yydouble{TT, 2/1) 

Thus, the complete macro attributed tree transducer 
Nssq = {Ati, 17, Aj sum, R, E) looks as follows. 

• Att = Attsyn u Attinh with 

Attgyn = {sum^^^} and Attinh = {squ^^\ double^^^} 

• i:={(7(2),aW} 

• A = 

• iZ<7: 
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- sum{ir, j/i) — ► sum{irl, sum{ir2, yi)) 

- squ{irlyyi) double{Tr^squ{7r,yi)) 

- squ{7r2y y\) — ► dotx6/e(7r, squ{TT, t/i)) 

- doti6/e(7rl,t/i) yydouble{7r ^yi) 

- double{ir2j t/i) jydouble(7r , y\) 

Rd‘ 

- sum{7Ty yi) squ{ir, yi) 

• E = (£^i, E 2 ) with El = (a) and E 2 {squ) = yi and E 2 {double) = 7(2/1). 

After having defined the derivation relation of macro attributed tree trans- 
ducers we will also show a computation of □ 

Note 7,4- 1) A macro attributed tree transducer in which the set Att of 
attributes is unary, is an attributed tree transducer and vice versa. 

2) A macro attributed tree transducer in which there are no inherited 
attributes (i.e., AtUnh = 0), is very close to a macro tree transducer: synthe- 
sized attributes correspond to states; a rule a(7r, 2/ij • • • , 2/n) — ► ^ in iia with 
<7 G corresponds to the rule a{<T{xi , . . . , a?jb), 2/1 , • • • > 2 /n) — where is 

obtained from ^ by replacing every subterm ...) by a'(xi, ...). □ 



Similarly to attributed tree transducers, we fix the following notions and 

notations for a macro attributed tree transducer N = (Att, E,A^ao,R,E): 

• In the rules of iJ, the variable tt is called path variable. 

• The notions of set of inside attribute occurrences of (7, set of outside at- 
tribute occurrences of (7, and set of attribute occurrences of a are defined 
in the same way as for attributed tree transducers. In particular, the fact 
that an attribute can have an arbitrary rank greater than 1 is ignored. 

• The right-hand side of the rule a(7r, 2 /i» • • • > 2 /n) — ^ ^ in Ra is abbreviated 

by rhs(a(7r), a) and the right-hand side of the rule b{m, 2/1, • » 2 /n) ^ ^ in 

Ra is abbreviated by rhs{b{m)^a). For <7 G we denote by RHS{a) 
the set of right-hand sides of rules in R^, i.e., RHS{a) = {rhs{c{TrTj),a) | 
c(7rr/) G in{<r)}. 



7.2 Induced Tree Transformation 

For the rest of this chapter, let N = (Att,E^Ayao,R,E) be an arbitrary, 
but fixed, macro attributed tree transducer. Next we define the derivation 
relation of a macro attributed tree transducer N on an input tree s (see Figs. 
7.1 and 7.2). As in Def. 5.5, the occurrences of s are considered as nullary 
symbols. 

Definition 7.5. Let s £ Ts. The derivation relation induced by N on s is 
the binary relation =>n,s over the set TAttuAuocc(8){y) such that for every 
<p,^€ TAuuAuoee(^>)(Y) we define tp =>jv,» rp, if 
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Fig. 7.1. A derivation step induced by N and triggered by a synthesized attribute 



• there is d> context j3 ^ ^Attu^iuocc( 5 )ui^,i 

• there is an attribute c G with n > 0 

• there is a path w G occ(s) 

• there are trees E TAuuAuocc(s)(y) 

such that <p = jS[c(w, y?i, . . . , (pn)] and one of the following conditions holds. 

(1) c G Atisyny label{s, w) = a E with Ar > 0, and 

^ ^ ^1, • • • , yn ^ ^n]] where ^ = rfts(c(7r), a). 

(2) c G Aitinhy ^ for some v G occ(s), label{s,v) = <r, (t G with 
A: > 1, 1 < i < fc, and 

^ v][yi ^ ^ 1 , . . . , 2/„ ^ ^n]], where ^ = r/is(c(7ri), (t). 

Example 7.6. Consider the macro attributed tree transducer Ngsq of Exam- 
ple 7.3 and the input tree s = o'((7(q', a), a). In the following derivation we 
abbreviate =^jvr ,,,,5 to =>. 
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Fig. 7.2. A derivation step induced by N and triggered by an inherited attribute 
sum{e,yi) 

=>► sum{ly 5wm(2, t/i)) 

=> st/m(ll, sum{l2, sum{2^ 2/i)))* 

Next we compute sum(2,yi) 

sum( 2 , 2 /i) 

=>► squ{2,yi) 

=» double{e, squ{e, yi)) 

and we call this result ti. Then we compute stim(12,ti). 

sum(12,ti) 

=> squ(12,ti) 

doubie(ly squ(lyti)) 

^ doub/e(ljdouble(ey squ(Syti))) 
yydouble{Sy double{Sy squ{£y ti))) 

and we call the result < 2 . Finally we compute sitm( 11,^2). 






7.2 Induced Tree Transformation 227 



sum{llyt2) 

=> squ{llyt2) 

double{l,squ{ljt2)) 
double{l, double{e, squ{e, ^2))) 

=> yjdouble{e, double{€, squ{e^ ^2))) 

Thus overall Ngsq has derived the output tree 

jjdouble{e, double{Sy squ{Sj yydouble{e, double{e, squ{e, 
double{e, squ{e, yi)))))))). 

By means of the environment of iV, the inherited attributes at the root 
of the input tree will be interpreted (in Def. 7.23); roughly speaking, double 
will be replaced by one more 7 and squ is simply dropped. Then the result 
of this interpretation is the tree 7^ (or) which is the desired result, because 

= 2^ + 2^ + 1^ = 9. 

□ 

Definition 7.7. Let s be an input tree of N, The set of sentential forms of 
N and s, denoted by SF{N^ s), is the smallest subset SF of TAttuAuocc(s)(y) 
for which the following conditions hold. 

(i) For every c G Att^^'^^^ with / > 0, ix; G occ(s), and (pi,..,,(pi G SF, the 
term c(ti;, tpi , . . . , G SF. 

(ii) For every 6 G with / > 0 and (pi,...,(pi e SF, the term 6{(pi ,...,(pi) 
is in SF. 

(hi) Y CSF. □ 

In the following, it is also useful to have a notation for the particular 
set of sentential forms available. These sentential forms occur at the end of 
derivations and hence, they are called final sentential forms. They consist of 
output symbols, parameters, and inherited attributes applied to the occur- 
rence e. In fact later it transpires that it is useful to restrict the set of possible 
parameters. 

Definition 7.8. Let U C y be a set of parameters. The set of final sentential 
forms of N and U, denoted by SFjin[N,U), is the smallest subset SF of 
'^Atiinhy^Aij{e}{U) such that the following conditions hold. 

(i) For every 6 G Att^^^^^ with / > 0 and y>i, . . . , y?/ G SF, b{e, <pi, . . . ,(pi) e 
SF. 

(ii) For every 6 G with / > 0 and cpi,. . .,cpi G SF, the term 6{(pi ,...,(pi) 
is in SF. 

(hi) U CSF. □ 

As in the case of the first three formal models of syntax-directed se- 
mantics, there is a strong relationship between right-hand sides of rules and 
sentential forms fsee Lemmas 3.7. 4.8. and 5.8L 
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Lemma 7.9. Let fc,n > 0, ^ G RHS{Attsyn,Attinhi^,k,n), s G Ts, w G 
occ{s) such that, for every 1 < i < fc, the occurrence wi is in occ(s), and let 
(pi...,(p„€ SF(N, s). Then ^[tt <- ty][j/i <- , y„ <- v>„] € SF{N, s). O 

Here, as for attributed tree transducers, we cannot state a representation 
of every sentential form as the result of a substitution performed on some 
right-hand side of a rule. Again this is due to the missing neighborhood 
relation between the paths which are involved in an arbitrary sentential form; 
however the relation is necessary for such a representation (see the discussion 
following Lemma 5.8). 

Lemma 7.10. Let s be an input tree of N. Then SF{N,s) is closed under 

Proof. The proof of the lemma is by structural induction on sentential forms 
in SF{N^s). It is very similar to the proof of the corresponding lemma for 
attributed tree transducers (Lemma 5.9). 

□ 

Clearly, since macro attributed tree transducers are extensions of at- 
tributed tree transducers, they share the possibility of cyclic derivations. 

Definition 7.11. 1. TV is circular^ if 

• there is an s G Tj;, 

• there is a sentential form c{w, ti, . . . , ^p) G SF{N, s) with c G 
p > Oj w E occ(s), and i^i, . . . , tp G 

9 there is a context ^ G 

• there are , . . . , y?p G SF{N^ s), 

such that c{w, ^[c{w, fp)]. 

2. N is noncirculary if it is not circular. □ 



Our definition of circularity is pessimistic in the sense that even a macro 
attributed tree transducer is quoted as circular, if the attribute which is 
responsible for this property occurs in a parameter position of another at- 
tribute and the latter attribute ignores this parameter (“partialness followed 
by deletion”). We illustrate this phenomenon by an example. 

Example 7,12, Let AT be a macro attributed tree transducer such that the 
set R of rules contains the following subsets for the input symbols 7 ^^^ and 
aW. 

• R^\ 

- ao(7T, yi) ao(7rl, a(7rl)) 

- a(7r) — ► a 

- 6(7t 1) a(7rl) 

• Ra, 
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- ao(^,J/i) ^ a 

— a(7r) — ► b{ir) 

Note that rhs{ao{w)^ 7) does not contain yi. N is circular, because for the 
input tree s = 7(a), N can derive: 

a(l) =>N,8 K^) =>N,s a(l) 

which is a cycle. 

However, for every input tree s, i.e., s = 7” (a) for some n > 0, there is a 
terminating derivation: 





ao(e,yi) 


=>N,$ 


ao(l,a(l)) 


=>N,t 


ao(ll,a(ll)) 


=>N,s 


ao'(l”,a(l”)) 


=^N,s 


a 



This pessimistic point of view is incorporated in the definitions of the 
dependency graphs in which no distinction is made as to whether an attribute 
occurs nested in a parameter position or not. 

Definition 7.13. Let a G with k > 0. The dependency graph of a 
for Ny denoted by Disr{a)y is the directed graph {att{(T)y E{a)) with E{(t) = 
{((c, 7T7/), (c', ttt;')) G out{a) x in{a) \ rhs(c'(7rr;'), a) contains a subtree of the 
form c(7T77, , . . . , as a set of arcs. □ 

The dependency graph of an input tree s (Def. 5.13), the composition of a 
dependency graph with is-graphs of input trees (Def. 5.15), and the is-graph 
of s for N (Def. 5.15) are defined in the same way as for attributed tree 
transducers. Moreover, the lemma on which we can base the circularity test 
for macro attributed tree transducers and the test itself can be taken from the 
case of attributed tree transducers (Lemma 5.17 and Fig. 5.7, respectively). 
Hence, we do not repeat these definitions, lemmas, and the algorithm here. 

In the rest of this chapter the arbitrary, but fixed, macro attributed tree 
transducer N is assumed to be noncircular. Then we can prove that, for every 
input tree s, the derivation relation is confluent and terminating. 

Lemma 7.14. For every input tree s of Ny the derivation relation is 

locally confluent on SF{NyS). 

Proof. The proof is very similar to the proof of the corresponding property 
of the derivation relation of macro tree transducers (Lemma 4.10) and hence, 
it is not repeated here. □ 
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As in the case of attributed tree transducers, we define the notions of 
w-reducible sentential form and minimal cj-reducible sentential form (Def. 
5.21). The following observation about the existence of minimal w-reducible 
subtrees corresponds to Observation 5.22. 

Observation 7.15. Every a;-reducible sentential form has a minimal u;- 
reducible subtree. □ 

Since attributes may have a rank greater than 1, minimal cj-reducible 
sentential forms of macro attributed tree transducers appear more general 
than the corresponding forms of attributed tree transducers (Observation 
5.23). 

Observation 7.16. Every minimal cj-reducible sentential form has the form 
• • • >6) for some attribute c, occurrence w, and sentential forms 
^ 1 , . . . , moreover, the , . . . , are not cj-reducible. 

□ 

Observation 7.17. If y? is a minimal a;-reducible sentential form, then there 
is an infinite derivation which starts from (p by application of a rule to the 
root of (p. □ 

The proof of the fact that is terminating proceeds in almost the same 
way as the proof of the corresponding property of attributed tree transduc- 
ers (Lemma 5.24). Recall that we are only dealing with noncircular macro 
attributed tree transducers. 

Lemma 7.18. Let s be an input tree of N. Then =>n $ is terminating on 
SF{N,s), 

Proof. Again we prove this fact by contradiction. Let N he a, noncircu- 
lar macro attributed tree transducer and assume that there is an s E Ts 
such that ^N,8 is not terminating on 5F(iV, s). Then there is an a;-reducible 
sentential form (p E SF{N^s), By Observation 7.15, <p has a minimal uj- 
reducible subtree, say v?o,o, and, by Observation 7.16, ^o,o has the form 
co(^^o,^o,i, • • • ,^o,Zo)- Hence, there is an infinite derivation 



y ? 0,0 =^ 7 V ,5 ^ 0,1 ^ 0,2 • • • 



Clearly, ^o,i is also an az-reducible sentential form and it has a minimal uj- 
reducible subtree, say Moreover, there is an infinite derivation 



^ 1,0 => N ,8 ^ 1,1 =^ N ,8 y ? l ,2 • • • 



Again, (pi^i is az-reducible and it has a minimal a;-reducible subtree ^ 0 , 2 • 
Obviously, this construction of minimal subtrees can be repeated and thereby 
we obtain two infinite sequences (y?»,o)*>o and (^,*,i),>o such that, for every 
2 > 0, the following three properties hold: 
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1. (pi^Q has the form for some attribute c,*, some occur- 
rence Wi of s, and some sentential forms and 

are not cj-reducible. 

2 . ^*,0 ^ n ,8 ^*, 1 * 

3. v?*+i,o is a minimal a;-reducible subtree of (pi^\. 

This implies that, for every i, j >0 with j > i, there is an {Att \JA\J occ{s) U 
y, l)-context /?, j such that 



¥><.0 ^N,, A jbj.o]. 

Since every (pi^o has the form stnd since the set {c(u;) | c G 

Att^w G occ(s)} is finite, there must be indices io,jo with jo > io such that 
Cio = Cjo and WiQ = Wj^. Hence, there is a {Att U A \J occ{s) U Y, l)-context 
P such that 

^*0,0 = ^N,s • • *)^io,bo)] 

— /^[^»o(^»o ) ^io)l > • • • >^io,b'o)]* 

Since no rule in this derivation is applied to the trees in the parameter posi- 
tions of ^,* 0,0 (because is minimal cj-reducible), it follows that for every 
^1 J • • • ) ^/io ^ Ta there are , . . . , such that 

C.0 /?[c.o (U^.o , 6 , • • • > )] • 

This is a contradiction to the assumption that N is noncircular. □ 



Lemma 7.19. For every s £ the derivation relation =>n,s is confluent 
on SF{N,s). 

Proof. This statement follows immediately from the facts that =>jv ,5 is lo- 
cally confluent and terminating (Lemmas 7.14 and 7.18) and from Newman’s 
lemma (Lemma 2.5). □ 



Lemma 7.20. Let s G For every p G SF{N,s), the normal form 
nf{=>N,s,^) exists. 

Proof. The existence of the term n/(=>jv^ 5 , ^) is guaranteed by the fact that 
=^N,8 is confluent and terminating on SF{N, s) (Lemmas 7.18 and 7.19) and 
by Corollary 2.7. □ 

Corollary 7.21. For every s E c E Att^^'^^^ with n > 0, u; G occ(s), 
and pi,..,,pn G SF{N,s), 

nf{^N,s,c{w,pi,,,.,pn)) = 

nf{^N,s , c{w, nf{=^N,s , ^i), . . . , n/(=>jv,5 , ^n))) 
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Proof. (See the proof of Lemma 4.14) By Lemma 7.20, 

,(fi), for every 1 < i < n. Hence, c(iy, ^i, . . . , y?n) ^n,s n/(=^^Ar ,5 
, y>i), . . . , nf{=>N^s , (pn))^ Thus our corollary follows from Corollary 2.8. □ 

The notion of the condensed dependency graph of an input tree s is defined 
in the same way as for attributed tree transducers (Def. 5.29). Hence, the 
definition of topological sorting of inside attribute occurrences in Def. 5.30 
applies to the situation of a macro attributed tree transducer. This topological 
sorting is used in the proof that the normal forms of sentential forms are final 
sentential forms. 

Theorem 7.22. For every p G 5F(AT, s), the inclusion 

nf{^N,s,<p)eSFfir^{N,Y) 



holds. 

Proof. By simultaneous induction (using Principle 5.28), we can prove that 
the following predicates K and L hold on Tjj. 

K: For every a G Aitsyt^^ with n > 0, and s the inclusion 

nf{^N,s , a{e, yi, . . . , Vn)) G SFfin{N, Y) holds. 

L: For every A; > 0 and a G C G RHS{a)^ and (si, . . . , sjb) G (Ts)^ ^ 
the inclusion ^ ^]) ^ SFfin(N,Y) holds. 

Since we already have the proof for the corresponding theorem for macro 
tree transducers (Theorem 4.15) and for attributed tree transducers (Lemma 
5.32) in detail, we omit the proof of K and L here. □ 

Now we can define the tree transformation induced by N. 

Definition 7.23. Let E = (F?i,J? 2 ) be the environment of N with Ei = 

(^i,...,^r). 

(a) Let a G with n > 0 be a synthesized attribute. The tree 

transformation induced by N with a is the function : Ts — ► 

SFfin{N,Yn) which is defined, for every s eTjj.hy = nf{=^N,s 

(b) The tree transformation induced by N is the function : Tjj 

such that, for every s £ Ts, t^^{s) = E{Tfff^^{s)) where E : 
SFfin(N^Yr) — ^ is defined by structural induction as follows. 

(i) For every b e / > 0 and y-i, . . G U 

Yr), 

E{b{e, (pi,..., <pi)) = ^ 2 (b)[yj E(<pj); 1 < i < /]. 

(ii) For every 6 G with I > 0 and <Pi,...,(pi€ TAttMOA{{e} U ^r), 

E{S{tpi,...,<pi)) = 6(E{<pi),...,E{(p,)). 
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(iii) For every yj G Yr, E{yj) =tj. □ 

A tree transformation r is called a macro attributed tree transformation^ 
if T can be induced by some macro attributed tree transducer. The class of 
macro attributed tree transformations is denoted by MAT. 



7.3 Characterization of Macro Attributed Tree 
Transformations 

As for macro attributed tree transformations we provide an inductive charac- 
terization. Clearly, this combines somehow the characterizations of Sects. 4.3 
and 5.3. We use the principle of definition by simultaneous induction (Princi- 
ple 5.36) to define sets of functions which characterize the derivation relation 
of macro attributed tree transducers. In fact, this definition can be seen as 
an amalgamation of Defs. 4.18 and 5.37. We also refer to the definition of 
topsort in Def. 5.30. 

Definition 7.24. Let < be a linear order on Alt. We define the set 
F<.n = : Ts SF,in{N,Y) \ a € Att.y„} 

of functions and the set 

G<,n = : {Tsf ^ SFji„{N,Y) \k>0,<re G RHS{ct)} 

of functions by simultaneous induction according to Principle 5.36 as follows. 

Application of IB: For every a G and ^ G RHS{(t), we define 
<7,^(0) structural induction on 

(i) Since a has rank 0, no right-hand side of a (r-rule contains a synthesized 
attribute and hence, case (i) of Def. 7.1 cannot occur. 

(ii) If ^ for some b € ^ ^ 0, and € 

RH S{Attsym Attinhi 0, n), then we define 

= Ke, (0). • • • . (()))• 

(hi) If ^ = 5(^1,..., 6) for some 6 G with / > 0 and ^ 

RH S{Att8yn, Attinhj A, 0, n), then we define 

. . . , Cl (()))• 

(iv) If ^ = yj, then = yj. 
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Application of IS 1: For every a G with n > 0 and s = 

cr(si , . . . , Sjk) E Ti; , we define 






where a; = (si, , . .,5jfc). 



Application of IS2: Let Ar > 1, cr G and 5i, . . . , sjfc G Ts- Let w = 
iopsori{N, <, 5 ) with s = <t{si , . . . , s^). By (finite) mathematical induction 
on z/ with 1 < z/ < length{w), we define where lj = 



Induction base z/ = 1: The value T^^^a,rhs{w(i),(r)(^^ defined by struc- 
tural induction on rhs{w{l),a). Let SUB represent sub{rhs{w{l),(r)). 

(i) Let d(7ri, , . . . , E 5C/B for some d G with / > 0 and 1 < i < 

Ar, and e SUB. Since z/ = 1, the value E in 

particular; this value does not contain inherited attributes. Thus we can 
define 



_tnd 

^<,Ar,<7,d(7T*,^i 






—ind 



(‘^); 1 < i < ^] 



(ii) Let b{n, £ SU B for some b € with / > 0 and 1 < i < Ar, 

and ^ 1 , . . . , G SU B. Then define 



^<"W, <7, 6(7T,«1,. 



u) = b{u), rip^N,c,( 



.M.-- 



—ind 






(iii) Let 5(^i, . . .,6) ^ SUB for some 6 G ^ > 0, ^i, . . . ,^z G SUB. Then 
define 



co(‘") = 

(iv) Let yj G 5C/J5. Then define = %• 



Induction step z/ — » z/ + 1: Assume that for every « with 1 < « < z/, 
the value a,rhs(w(K),a)i^) already defined. Then we define the value 
a rhs{w(i/+i),(T)i^) 1^^ Structural induction on rhs{w{u -h 1), (t). Let SU B 
abbreviate the set stz6(r/is(ti;(z/ -h 1), cr)). 

(i) Let d(7ri, , 6) E SU B for some d G Aiii^yn with I > 0 and 1 < i < Ar, 
and , . . . , 6 ^ SU B. Then define 



-ind 

'<,N,(7,d{niy^i 
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where ^ represents (^i , . . . , 6 ) stnd (j)^ . -is defined by structural induction 
on trees in SFfin(N, Yi) as follows (where Yi) is represented by 

5): 

(a) Let b{e, <pi^. . (p\) E S for some h G with / > 0 and 1 < ^ < fc, 

and v^i , . . . , V?/ G 5. Then define 

(b) Let 6 (^ 1 , . . . , y?/) G 5 for some 6 G / > 0, • • • j ^ S'. Then 

define 

(c) Let pj G 5. Then define 

(ii),(iii),(iv) These cases are the same as the corresponding cases of the in- 
duction base. □ 



As in Lemma 5.38, the order < does not matter here. 

Lemma 7.25. Let <i and <2 be two linear orders on Att. Then 

• for every a € Att^yn, ^ZuN.a = 

• for every k > 0, (t € S^’‘K and ^ € RHS{(t), r^"fAr.< 7 ,? = ° 

Thus, in the following, we will omit the linear order on the set of attributes 
from the notation of the r*”^-functions. 

Again we show the correctness and completeness of the r*”^-function with 
respect to the derivation relation. For this purpose we define, for every <t G 
with jfe > 0 and ^ G RHS{a), the function ^ : {Tsf SFfin{N, Y) 
such that j((si, . ..,«*)) = 

Theorem 7.26. 1 . For every s £ Ts and a G with n > 0, 

= rfPM 

2. For every i > 0, <r G ( G RHS{(t), and w = («i, . . . , sj,) G (T^Y , 

Proof. The two statements are proved by the established technique of si- 
multaneous induction using Principle 5.28. An example of a proof according 
to this principle is shown in Lemma 5.32. Since no unexpected conditions 
arise in the proof of the two statements and the proof technique is practically 
a combination of the proofs for macro tree transducers and attributed tree 
transducers, we do not give the proof here. □ 
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7.4 Composition and Decomposition Results 

First we will prove a decomposition result for macro attributed tree trans- 
ducers which generalizes the decomposition of MAC into TOP and YIELD 
(Theorem 4.37). In fact, the two underlying lemmas are straightforward gen- 
eralizations of the corresponding lemmas for the decomposition of macro tree 
transducers (Lemmas 4.34 and 4.36). 

Lemma 7.27. MAT C ATT o YIELD, 

Proof. (See Lemma 4.34) Let N = {Att,E,A,ao,R,E) be a noncircular 
macro attributed tree transducer with E = (£^i,jE? 2 ) and E\ = (<i,...,tr) 
and r -h 1 = rank{qo). Denote the number max{n \ U ^ 0} by 

mx. 

We construct an attributed tree transducer A = (Att', i7, T*, Uq, iZ', £"), 
a function g : T^iZmx)^ and a tuple i G (T^)"*® such that tn = 

ta o yieldg^, 

• Define Alt' = {Ait')syn U {Att')inh where {Att‘)syn = {a' | a G Alisyn} and 
{AtP)inh = {P \f> E Attinh}- Every attribute in Att* has rank 1 . 

• Define T to be the ranked alphabet such that T^^^ = ^ ^ U 

{ai, . . . , amx} where the a’s are new symbols, and, for every k such that 

/ 0 or ^ 0 , define T^^^ = {cj^} and T^^^ = 0 otherwise. 

• Define the function g : ► T^iZrax) by g{6') = 6 (^ 1 , . . . , Zjk) if 5 G 

A^^\ and g{oci) = Zi for every i G {1, . . . , mx}. 

• Define t = E if mx = r, and f = (ti , . . . , tr,Zr+i , . • . , Zmx) if mx > r. 

• For the construction of the rules of A we need the set 

COMB = {COMBk^n\k^n > 0} of functions COMBk^n • 

RHS{Attsyn,Aitinh, A, kin) RHS{{Att')syn,{Att')inh, T, k) ' that 

are defined inductively on the structure of their arguments. 

(i) For every a G with I > 0, 1 < i < k and G 

RHS{Ait8yn,AttinhiA,k,n), we define C' 0 MBjb,n(a(^*, 6 > • j^/)) = 

C+i(a'(7ri),COM5,.n(6), • . .,C0M5,.n(6)). 

(ii) For every b G with / > 0 and G 

RHS{AttsymAttinh,A,k,n)y we define COMBk^n{b{Tr,^i,. . .y^i)) = 
cz+i(6'(7t), COMBkA^i ), . . . , COMBkA^i)). 

(iii) For every 6 G A^^^ with / > 0 and ^ 

RHS{Attsyn,Attinh,^yk,n)y we define COM5jb,„(5(^i, . . . ,^/)) = 

q+i( 5', C0M5,,n(6), . . . , COMBfc,n(^ 0 )- 

(iv) For every 1 < j < n, we define COMBk^niVj) = Qfj- 

Now we define iZ' to be the set every a G with Ar > 0 

we define: 



K = {a\ir) ^ COMBkAO 1 2/i, • • • , yn) - 0 ^ RA 

U{6'(7T2) -►C'OMBik,n(01(K^^yi>--*)yn) “^0 ^ ^<t}- 
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• Define E* : {Att^)inh Tp such that, for every 6 G and n > 0, we 

let E\V) = COMBo,n(£^ 2 ( 6 )). 

Due to the definition of the dependency graphs for macro attributed tree 
transducers (Def. 7.13) and for attributed tree transducers (Def. 5.12), it is 
clear that A is also noncircular. 

We extend the function g to 

Qe : U {b\e) I b G AtUnh} ^ 

such that for every 6 G with n > 0, ge{b*{e)) = 6 (e, 2 ^ 1 , ... , z„). 

The correctness of the construction can be shown by proving the following 
statements by simultaneous induction where we represent the substitution 
[- 2^1 ^ yi, . • • , ^ Vn] by [z ^ t/]„: 

K: For every n > 0, a G and s G Tp, the relation 

yieldg^{TA,a'{s)) G SF{N,Zn) holds and the equality TN,a{s) = 
yieldg/{TA,a>is))[z ^ y]n is true. 

L: For every Ar > 0, (t G ^ G RHS{a), and u G the rela- 

tion yieldg^(TA^a,coMBH,nio(^)) ^ SF{N,Zn) holds and the equality 
T’iv.a.eC^) = ^ y]n i® true. 

These statements can be proved by straightforward simultaneous induc- 
tion and hence, it is left to the reader. The correctness of the construction 
conforms with that in the proof of Lemma 4.34. □ 



Lemma 7.28. ATT o YIELD C MAT 

Proof. Let A = (Att, i7, T, oq, iJ, £*) be an attributed tree transducer, g : 
f(o) _► T^(Zr) such that for every a G T, g{a) is linear in Z, and t = 
(ti, . . . , tr) for some r > 0. We construct a macro attributed tree transducer 
N = {Ati\ Ey A, Uq, E*) such that ta o yieldg j = rjv. 

• Define Ait* = {Att*)syn U {Att*)inh and {Atf)syn = {a' [a G Attsyn) and 
(Att')inh = I b G Attinh}] moreover, every attribute has rank r -f 1, i.e., 
Atf = 

• Extend y to : F^^^ U Att({7r, ttI, . . . , tt/c}) — »• T^DAtv{Zr U 

{tt, 7t1, . . . , 7T/c}) where k = max{k \ ^ 0} by defining, for every c G Att 

and rj G {e} U {1, . . 

gjc{c{7rri)) = c\irrj,zi,.,.,Zr). 

(Clearly, for every a G F^^^ we have y'(a) = y(o).) 

• R^ is the set \J^^s{R^)a where (i?')<r is the set of rules such that, if, for 
some c G Att and 77 G {e} U {1, . . . , /c}, the rule c{irrj) ^ is in Ra, then 
the rule 
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yi, . . . , yr) ^ yieldg^iOl^i ^yi,.--,Zr yr] 

is in (ii')< 7 . 

• E' = (Ei,E2) where Ei = {tu...,U) and E2 : {Att')i„h -* T^(Y) 
is the function such that, for every b' G define E 2 Q)') = 

yieldg{E{h))[zi <- yi, . . . , Zr ^ yr]- 

The correctness of the construction is proved by showing the following two 
statements by simultaneous induction in which we represent the substitution 
[zi yii • • yr] by [z ^ y]r . As in the proof of the decomposition result 

MAT C ATT o YIELD (Lemma 7.27), we need an extension of g which can 
handle objects of the form b(e). Thus, we extend g to Qe : U {6(e) 1 6 G 

Attinh} T(^Att>)inhOA{{^} U Z) such that for every 6 G Atiinh- 

9e{b{s)) =6'(e,zi,...,zr). 

K: For every a G Attsyn and s G Ti;, the equality yield g^{TA,a{s))[z ^ y]r = 
TN,a*{s) holds. 

L: For every Ar > 0, (t G ^ G RHS{(t)j and cj G (Tj;)*, the equality 
yieldg^{TA,(T,du))[z ^ y]r = 'nv,<r,e'(^) bolds where = yieldg^{^)[z 4 - 

2/]f. 

□ 

From Lemmas 7.27 and 7.28, we can derive the following characterization 
of MAT. 

Theorem 7.29. MAT = ATT o YIELD. 

This characterization generalizes the characterization of MAC in terms 
of the composition of TOP and YIELD (see Theorem 4.37). Intuitively, it 
is clear that MAT = ATT oYIELD must be true, because on both sides of 
the equation the model which handles the customary replacement (i.e., macro 
tree transducers or top-down tree transducers) is generalized from a recursive 
descent device to the corresponding tree- walking device (i.e., attributed tree 
transducers and macro attributed tree transducers, respectively). 

From the characterization of MAT we can draw some conclusions. 

Corollary 7.30. MAT = ATT o ATT. 

Proof. 

MAT 

= ATT o Y I ELD (Theorem 7.29) 

C ATT o ATT (Corollary 6.24) 

C ATT o MAC (Lemma 6.1) 

= ATT o TOP o YIELD (Theorem 4.37) 

= ATT o YIELD (Theorem 5.47) 

□ 
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Corollary 7.31. MAT o TOP = MAT. 

Proof. 

MAT o TOP 

= ATToATToTOP (Corollary 7.30) □ 

= ATT o ATT (Theorem 5.47) 



Corollary 7.32. HOM o MAT = MAC o MAC 
Proof. 






HOM o MAT 
HOMoATToYIELD 


(Theorem 7.29) 


= 


MAC o YIELD 


(Theorem 6.25) 


c 


MAC o MAC 


(Lemma 4.32) 


= 


HOM 0 sl-MAC o MAC 


(Corollary 4.39) 


c 


HOMolv-ATToMAC 


(Note 6.9, Theorem 6.23) 


= 


HOM 0 Iv-ATT 0 TOP o YIELD 


(Theorem 4.37) 


c 


HOM 0 ATT o TOP o YIELD 


(It;- ATT C ATT) 


== 


HOM oATToYIELD 


(Theorem 5.47) 


c 


HOM o ATToATT 


(Corollary 6.24) 


= 


HOM o MAT 


(Corollary 7.30) 



7.5 Height Property 

In the chapters concerning top-down tree transducers, macro tree transduc- 
ers, and attributed tree transducers we have already shown an approximation 
of the height of an output tree i with respect to the height or size of the cor- 
responding input tree s: 

• for top-down tree transducers, there is a constant c such that height{t) < 
c • height{s) (Lemma 3.27), 

• for macro tree transducers, there is a constant c such that height{t) < 
^hetght(s) (Lemma 4.22), and 

• for attributed tree transducers, there is a constant c such that height{t) < 
c-size{s) (Lemma 5.40). 

In Corollary 7.30 it was shown that the class MAT is characterized by 
the two-fold composition of the class ATT, i.e., MAT = ATToATT. We can 
use this fact in order to derive an approximation of the height of the output 
trees for macro attributed tree transducers. 

Lemma 7.33. Let AT be a macro attributed tree transducer. There is a con- 
stant c > 0 such that, for every s £Tjj^ height{rN{s)) < 2^ **^®(*). 
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Proof. By Corollary 7.30, there are two attributed tree transducers A\ and 
A2 such that tm = TAi 0^42 • Let s eTj; and t ETa such that tn{s) = t. Then 
there is a such that rAi(s) = and rAa(t') = t. By Lemma 5.40, there are 
constants c and d such that height{f) < c-size{s) and height{t) < d-size(f). 
Thus, height(t) < d ■ size(t') < d ■ for some constant 6 > 0, 

according to Lemma 2.13. Then, rf. < d . ‘ < 2« for 

some constant e. □ 



7.6 Bibliographic Notes 

Most of the material presented in this chapter has been taken from [KV94b]. 
Example 7.3 is due to [Hou94]. 

In [Kiih97] the syntactic single use restriction of [GG84, Gie88] is consid- 
ered for macro attributed tree transducers. It transpires that the resulting 
class of tree transformations is a strict subclass of ATT. 




8. Two Examples 



In this chapter we will illustrate the usefulness of the formal models pre- 
sented by specifying two non-trivial functions in a syntax-directed way via 
tree transducers. Clearly, our formal models are rather archaic, e.g., they 
abstract from many-sortedness and from concrete computations in particu- 
lar semantic domains. Thus, in order to provide a more realistic specifica- 
tion language, we will enrich macro attributed tree transducers by these two 
features. This yields the formal model of so called many-sorted macro at- 
tributed tree transducer, and the two functions will be formalized by means 
of such transducers. The first function is the contextual analysis of a small, 
block-structured programming language. The second function is the insertion 
function in 2-3 trees. 



8,1 The Specification Language 

Here we superpose the many-sortedness and the possibility of concrete cal- 
culations in semantic domains on to the concept of macro attributed tree 
transducers; this leads to the concept of many-sorted macro attributed tree 
transducers. The formalization is straightforward, although slightly techni- 
cal. However, for the sake of completeness, we have decided to include this 
formalization in the book. First we need some notions concerning sorted sets 
and many-sorted algebras. 

Let 5 be a set of sorts (or types); as examples of sorts in this context 
we mention integer, boolean, real, list of ..., environment. An S-sorted set 
is a tuple (A, sort) where A is a set and sort : A — ► 5 is a function which 
associates a sort with every element in A. For every /c G 5, we represent by 
the set of elements of A which have sort k. An S-ranked alphabet is a 
finite S* X 5-sorted set. 

Let (I!,sort) be an 5-ranked alphabet and A be an 5-sorted set. The 
set of S -sorted trees over E indexed by A, denoted by Ts{A), is the smallest 
5-sorted set T such that the following conditions hold. 

(i) For every « G 5, the set U A'® C T'®. 

(ii) For every a G with /c, /ci, . . . , /cjb G 5 and trees t\ G , • • . , 

th G . the tree (T^t^ tk) 
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We note that, if 5 is a singleton, then an 5-ranked alphabet (i7, sort) is 
a usual ranked alphabet, and the set of 5-sorted trees over E is the set of 
usual trees over E. 

Let (Z*, sort) be an 5-ranked alphabet. A many-sorted E-algebra >1 is a 
pair (A, a) where A is an 5-sorted set such that, for every k E 5, the set 
A'® ^ 0, and a, called the interpretation function of A, is a function such 
that 

(i) for every a E a(cr) E A^ and 

(ii) for every a E a{a) is a function of type A^^ x . . . x A^* A^. 

We also have to associate sorts with the variables which occur in a many- 
sorted macro attributed tree transducer. For this purpose, we fix the sets IS 
and OS of input sorts and output soris^ respectively. Then, let Y = {Vi,K \ i > 

0, AC E 05} be a sorted set of variables such that sorf(2/»,/c) = ac. Moreover, let 
n = {wi I L E IS} be the 75-sorted set of path variables such that sort(iTt) = 

1. 

Definition 8.1. A many-soried macro attributed tree transducer is a tuple 
N = (Atty Z, A, A, ao, 72, E) where 

• Att is an (75 • OS*) x 05-sorted set, called the set of attributes'^ thus, 
every element c E Att has a sort of the form (^aci . . . ACr , acq) for some l E IS 
and ACo, ACi, . . . , ACr ^ 05; Att is partitioned into the disjoint sets Attsyn and 

of which the elements are called synthesized attributes axid inherited 
attributes y respectively, such that Att fl (Z U ^) = 0. 

• Z is an 75-ranked alphabet, called the input alphabety and A is an 05- 
ranked alphabet, called the output alphabet. 

• A = (A, a) is a many-sorted ^-algebra; thus, in particular, A is an 05- 
sorted set. 

• ao E for some iq E 75 and acq,i, . . . , ACo,r, /co E 05 is a 

designated attribute, called the initial attribute. 

• 72 is a set every o’ E Z, the set 72^ of rules satisfies 

the following conditions. 

(1) For every a E and a E E^ with X = {ti . . . ijb, t)y the set R^j 

contains exactly one rule of the form 

2/1, «1 ) • • • ) 2/n,«n) ^ 

where ^ E RH S{Attsyny AtUnhy Ay I, ac)'^ and ac = (aci . . . ac„, ac). 

(2) For every b E a E E‘’ with X = (ii . . .ikyi), and I <i < ky 

the set Rff contains exactly one rule of the form 

K^4*,2/i,/ci,---,2/n,ACn) 

where ^ E RH S{Attsyny Attinhy Ay T, ac)^ and ac = (aci . . . ac„, ac). 
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Let X = (ti . . . Ajb, t) E IS* X IS and k = {ki . , ./Cn, k) E OS* x OS. The 
set RHS{Attsyn,Attinhi^i^iK) of right-hand sides over Alt sym AtUnh, 
A, if and k is the smallest 05-sorted set RHS satisfying the following 
conditions. 

(i) For every a E with 1 < i < Ar, / > 0, and E RHS^\ ..., 

E RHS ^^ , the term a(;K,,i, , . . . , is in RHS ^ . 

(ii) For every 6 G and ^ € RHS<>\ G RHS’>\ the term 

is in 

(iii) For every 5 G and G RHS^^, G RHS^', the term 

(iv) For every 6 G OS, C 

• E = {Ei,E 2 ) is the environment where Ei = and U E T^°’* 

(recall that the /co,»’s are the argument sorts of the initial attribute) and 
E 2 : Attinh T^(y) is a function such that for every b E Att\*^j^ '^^'^\ 
E2{b) E . • •,2/ib,/cfc})'". □ 

The notions and notations which were fixed for macro attributed tree 
transducers after Note 7.4, translate to the concept of many-sorted macro 
attributed tree transducers. In the following, let N = {Att, i7. A, ao, i?, E) 
be an arbitrary, but fixed, many-sorted macro attributed tree transducer. 

To define the derivation relation consistently, we also have to associate a 
sort with every occurrence w E occ{s) of a tree s. Intuitively, the sort of w is 
the result sort of the label of w. Formally, if s is a tree over E, w E occ(s), 
and label{s,w) = (7 E then sort{w) = l. 

The derivation relation of a many-sorted macro attributed tree transducer 
is defined in the same way as for a macro attributed tree transducer except 
that an additional derivation step may consist of a calculation in the algebra 
A. 

Definition 8.2. Let s E for some t E IS. The derivation relation induced 
by N on s is the binary relation over the set where ^ = Att U ^ U 
A U occ{s) U y , such that for every y?, E we define <p if there is 

a context (3 E and one of the following three conditions holds. 

(1) • There is a synthesized attribute a E Att^syn '*^^'^\ 

• there is an occurrence w E occ(s)^ with label{s^w) = cr E 
and 

• there are trees E (Pn ^ 

such that (p = l3[a(w^ ^i, . . . , (pn)] 

and i> = /?[r/is(a(7r,), &)[k, <- ^„]]. 

(2) • There is an inherited attribute 6 G 

• there is an occurrence w E occ(s)^* with w = vi for some v E occ{s)\ 

label(s,v) = <r G - 1 < * < and 

. there are trees (pi G ...,<Pn £ T^" 
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such that (p = /?[6(u;, , Pn)] 

and V = 0[rhs(b{irj), <- w][yi,«, <- y>i, . . . , y„,«„ <- y„]]. 

(3) • There is a 6 G /j(«i and 

• there are elements v\ E G 

such that y? = /3[6{vi , . . . , Vn)] 

and V’ = y3[a(5)(vi,...,Dn)]. ° 

It is clear from Condition (3) of Def. 8.2 that the interpretation of an 
element 6 is strict in the sense that all its arguments have to be evaluated 
first. 

Since we will not be proving theoretical results about many-sorted macro 
attributed tree transducers here, but rather using them to show their practi- 
cal relevance, we will also not be reconsidering the topics about circularity, 
confluency, termination, and inductive characterization. These notions and 
the corresponding results can be generalized in a straightforward manner 
from Sects. 7. 2-7.4. 



8.2 Contextual Analysis 

It is well known that contextual constraints of an imperative, block-structured 
programming language, like e.g. the scope rule and the type rule, cannot be 
expressed by a context-free grammar. Such constraints are said to be part 
of the static semantics of a language. Recall that the scope rule indicates 
the scope of a declared variable, and the type rule defines the way in which 
typed expressions may be combined. The contextual analysis checks whether 
the contextual constraints are obeyed by a given program. 

In this section we will formalize the contextual analysis by means of a 
many-sorted macro attributed tree transducer. The programming language 
concerned is a very simple imperative, block-structured programming lan- 
guage in the spirit of ALGOL 60; the language is called CHECK. It only 
allows the declaration of variables of type either boolean or integer, and the 
assignment of variables (see Fig. 8.1 for a CHECK-program). For the sake of 
simplicity, we will restrict CHECK-programs to have at most two variables 
a and 6. 

The scope and type rules of CHECK are very easy: the scope of a variable 
is the entire block in which it is declared, and the types of the variables 
involved in an assignment must be equal. Thus, in the program in Fig. 8.1, 
the scope of the integer variable b is statement stat 2 , and the scope of the 
boolean variable b is stall, i.e., the begin-end block, and stats (in fact, only 
stats, because b is redeclared inside stati). The statement stats does not 
yield a type conflict, i.e, it matches with the type rule, whereas stats yields a 
type conflict, because 6 has type boolean and a has type integer. Let us now 
define the context-free part of CHECK. We have indexed the abbreviations 
of productions by the corresponding types. 
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program 




var 6: bool; var a: int; 


{deck} 


begin 


{stati} 


var 6: int; 


{dech} 


b := a 


{stat2} 


end; 




b := a 


{stats} 


end 





Fig. 8.1. A CHECK-program P 



Definition 8.3. The context-free pari of CHECK-programs is generated by 
a context-free grammar which is specified by the following set He heck of 



productions: 






P — » program D; S end. 


MT.D), 

^2 


D —*■ var I : T 


JDD.D). 

^3 


D-D 




I := I 


^5 


S —*■ begin D; S end 


„(55,S). 

^6 


S-*S;S 


J^d). 


I^a 




I^b 


^9 


T —* int 


MO • 


T -V bool 



We view the set Hcheck of productions as an 75cif EC/f -ranked alpha- 
bet where I Sc heck = Intuitively, P, D, 5, 7, and T stand 

for program, declaration, statement, identifier, and type. □ 

In order to describe environments, i.e., the bindings of variables and types, 
and their maintenance formally, we construct a many-sorted algebra which 
determines several appropriate semantic domains. 

Definition 8.4. The semantic domain for the contextual analysis of 
CHECK-programs is the many-sorted i^cff^jCK-algebra Acheck = 
defined as follows. 



• OScHECK is the set {i,t,e,6} of sorts. 

• A is the O Sc HECK -sorted set A* U A* U A® U A*', where 
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- i4* = {a, 6} is the semantic domain of identifiers^ 

— A* = {ini, bool, undef} is the semantic domain of types, 

— = {p\p : A* A^} is the semantic domain of environments, and 

- A^ = {true, false} is the semantic domain of boolean values. 

• ^CHECK is the 05-ranked alphabet 

ty-eq^^^^^^} U Acheck.o 

with Acheck,o = nnde/^^’^)}. 

• For every i, i\ £ A* ,ti,t 2 E A* , p G A^ , and bi,b 2 E A^, define a as follows: 

a{look-up){p, i) = p(i) 

a{update){p, ii, t) = Ai. if i\ = i then t else p{i) 
a{po){) = Xi.undef 

a(and)(6i, 62) = true iff (61 is true and 62 is true) 

d{ty~eq){ti,t2) = true iff (fi = ^2) and t\,t2 E {int, bool} 

and, for every symbol 6 G AcHECK,0i define a( 5 )() = 6 . □ 

Now we construct a many-sorted macro attributed tree transducer N 
which performs the contextual analysis of CHECK-programs. The trans- 
ducer N uses the synthesized attribute Envir-syn and the inherited attribute 
Envir-inh to traverse depth-first left-to-right through a declaration and to 
accumulate the bindings between identifiers and types in a parameter posi- 
tion. After having accumulated the current environment, N distributes this 
environment to every statement of the program; the distribution is done in a 
strict recursive descent, i.e., there is no tree-walking involved in the statement 
part of the program. 

Definition 8.5. Define the many-sorted macro attributed tree transducer 
N = {Att, EcHECKyAcHECK^AcHECKyCheck-prog, R, E) as follows: 

• Att = Attsyn U Attinh where 

Attsyn = [Check-prog^^'^\ Check^^^*^\ 
and Attinh = {Envir-inh^^^'^^} 

• Echeck contains the following elements (as in Def. 8.3) which are grouped 
according to their result sort: 



^2 



Jdd,d) 
^3 



(II, S) (DS,S) (SS,S) 






'6 



>.T) %,T) 



^9 






• Acheck contains the following elements (as in Def. 8.4): 

update^^**^^\ p^'^\ 
look-up^^*'*\ 




6 (^. 0 , 



8.2 Contextual Analysis 247 



• Acheck is the semantic domain for the contextual analysis of CHECK- 
programs (Def. 8.4). 

• R contains the sets i?n, Rrio- 



Rri • 


Check-prog{TTp) 




Check{TTp2, Envir-syn{‘Kp\, po())) 




Envir-inh{irpl,yi^e) 




yi,e 


Rtq • 


Envir-syn{Trj),yi,e) 




Envir-inh{‘KD,update{yi g, Id{itDl), 
Ty(irD2))) 


Hr 3 • 


Envir-8yn{wD,yi,e) 




Envir-syn{iTD2, yi,*) 




Envir-inh(TTD2, j/i,e) 


—<■ 


Envir-8yn{wD 1, yi,e) 




Envir-inh^nol, yi,e) 




Envir-inh(TTD , yi_e) 


Hr 4 • 


Check{ns,yi,e) 




ty-eq(look-up{yi^e,Id(wsl)), 

look-up(yi^e,Id{iTs^))) 


Hr 3 ' 


Check{Ks,yi,e) 




Check{irs2, Envir-8yn{irsl, J/i,e)) 




Envir-inh{rsl,y\,e) 




yi,e 


Hre • 


Check{Trs,yi,i) 




and{Check{irsl, yi,e), 
CftecA:(7rs2,j/i,e)) 


Hrr' 


Id{irj) 




a() 


Hr 3' 


Id{iTi) 




K) 


Hr 3 • 


Ty{iTT) 




int{) 


Hrio* 


Tyiirr) 




boolQ 



• E = (£’i,£' 2 ) with £*1 = (), because the initial attribute Check-prog does 
not have context parameters, and £2 is an arbitrary function of appropriate 
type; in fact, it does not matter how £2 is defined, because Check-Prog 
does not depend on the values of the inherited attributes at the root of the 
input tree. 

Now let us consider again the CHECK-program P of Fig. 8.1. The abstract 
syntax tree tp of P (Fig. 8.2) is the tree: 

tp = r[^^'^\decli,r^^^'^\stati^stat2)) 

where 



stati = r^^^'^\decl 2 ,stat 2 ) 






' 10 



^9 



)) 
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dcd, = 
act, = 
stats = stats • 




Finally, we show the derivation of N with the input tree tp, where we 
represent =>N,tp by =>. 

Check-prog{e) 

^ Check{2, Envir-syn{l^ Po{))) 

^ Check{2, Envir-syn{12y po{))) 

^ Check{2, Envir-inh{l2, update{po{), Id{121), Ty(122)))) 

Check{2, Envir-inh{l2, update{po{), a(), mt()))) 

Check{2y Envir-inh{12, update{\i,undef ^ a, int))) 

=> Check{2, Envir-inh{l2, pi)) 

where pi = Ai. if i = a then int else undef 
=>« Check{2, Envir-syn{ll, pi)) 

Check{2, Envir-inh{ll, update{pi, 7d(lll), Ty(112)))) 




8.3 Insertion into 2>3 Trees 249 



Check{2^ Envir-inh{\l^ P 2 )) 

where p 2 = Ai. case i of a : ini\ b : bool] otherwise : undef 
^ C'AecAr(2, Envir-inh{\^ P 2 )) 

Check{2, P 2 ) 

^ and{Check{2\^ ^), Check{22, P 2 )) 

and(C'feecAr(212, Envir~syn{211, ^)), 

ty-eg(/ooAr-up(^, 7d(221)), look-up{p 2 , Id{222)))) 
and((7/iecfc(212, ^), ty-eq{look-up{p 2 ib), Iook-up(p 2 ,a))) 
where p^ = Ai. case t of 2 : int; 6 : int; otherwise : undef 
anrf(ty-eg(/ooAr-up(^, 6), look-up{ps,a))yty-eq{bool, int)) 
and(ty-eq{int j int) j false) 
false. 

Hence, the many-sorted macro attributed tree transducer N computes the 
answer false for the contextual analysis of the CHECK-program P. 



8.3 Insertion into 2-3 Trees 

This second example concerns the insertion of information into 2-3 trees. 
Such data structures are well suited for storing and retrieving information in 
a very efficient way. For our consideration, it is convenient and quite sufficient 
to identify a piece of information by its key; a key is assumed to be a natural 
number. 

A 2-3 tree is a balanced search tree in which the nodes may have rank two, 
three, or zero. A node with rank two carries one key, a node with rank three 
carries two keys, and a leaf can carry either one or two keys. We assume that 
every key occurs at no more than one node (for an example see Fig. 8.3(a)). 

A search tree implies the following. If n is a node of rank three and n 
carries the keys zi and Z 2 with zi < Z 2 , then 

• every key which occurs in the first subtree of n is less than z \ , 

• every key which occurs in the second subtree of n is greater than zi and 

less than Z 2 , and 

• every key which occurs in the third subtree of n is greater than Z 2 , 
and if n is a node of rank two and n carries the key z, then 

• every key which occurs in the first subtree of n is less than z, and 

• every key which occurs in the second subtree of n is greater than z. 
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Moreover, in a 2-3 tree all maximal paths have the same length; in other 
words, the leaves occur at the same depth. 

Here we represent 2-3 trees as trees over an appropriate ranked alphabet 
(see Fig. 8.3 (b), where the dotted circles indicate the corresponding nodes in 
(a)). Since the keys are natural numbers and since we do not want to restrict 
the set of possible keys, we will construct a sorted ranked alphabet which is 
infinite. Clearly, this extends the concept of many-sorted macro attributed 
tree transducers again. However, this extension is justified, because it is not 
exploited to code extra transformational power into the concept. 




Fig. 8.3. (a) A 2-3-tree and (b) its representation 
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Definition 8 . 6 . 1. The set of input sorts for 2-3 trees is the set 752,3 = 

{t, n}; the input sorts t and n stand for tree and natural number^ respec- 
tively. 

2. The ranked alphabet for 2-3 trees is the 752 , 3 -ranked alphabet 772,3 = 
natural number}. 

3. The set of 2-3 trees, denoted by T 2 , 3 , is the set {t E 7^72 3 1 there is a 

such that every path u; of f with label{t,w) G {end-two, end-three} has 
length k}. □ 

Definition 8.7. The semantic domain for 2-3 trees is the many-sorted ^ 2 , 3 - 
algebra w42,3 = defined as follows: 

• The set 052,3 of output sorts is the set {t,n,b}; the sorts t and n have the 
same meaning as in the set 752,3 of input sorts and 6 stands for boolean. 

• The output alphabet Z\2,3 is the 052 , 3 -ranked alphabet 7 ^ 2,3 U {if-then- 

else^^^*’*\ We will write expressions, which include if-then-else 

or <, in the usual infix-notation. 

• Let A = A* U U where 

— A* = T 2 , 3 , i.e., the set of 2-3 trees 

— A” = N, i.e., the set of natural numbers 

— A^ = {true, false], i.e., the set of boolean values. 

• For every fi,f 2>^3 £ A^, zi,Z 2 E A^ , define a as follows: 

a{two){ti,zi,t2) = two{ti,z\,t2) 

oc{three){ti ,z\,t 2 ,Z 2 , U) — three{t \ , Zi , < 2 , - 2 T 2 , ^ 3 ) 

a{end-tw6){zi) = end-two{z\) 

a{end-three){zi, Z 2 ) = end-three{zi, Z 2 ) 

a{if-then-else){true,ti,t 2 ) = t\ 

a(if-then-else)(/a/se,fi,f 2 ) = ^2 

Qf(<)(^ij - 2 ^ 2 ) = true iff z\ is less than Z 2 . 

□ 

Condition (3) of Def. 8.2 stated that every symbol 6 G ^ is interpreted in 
a strict way. In particular, for if-then-else this is not desirable because the 
values of both cases (then and else) have to be evaluated before a decision 
in favor of one of them is performed. Hence, we add a fourth condition to 
the definition of the rewrite relation of many-sorted macro attributed tree 
transducers, which handles the non-strict interpretation of if-then-else. 

Definition 8 . 8 . (Modification of Def. 8.2) Let s £ T^. The derivation re- 
lation induced by N on s is the binary relation =>n,s over the set where 
^ = AttUAUA\Jocc{s)UY, such that for every <p,rp we define (p =>n,s 

if there is a context /? Clin ^ and nnf» nf thp fnllnwino* fmir mnditirknc Virkldc 
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(1) As in Def. 8.2 

(2) As in Def. 8.2 

(3) As in Def. 8.2 except that 6 if-then-else 

(4) There is an if-then-else G , there are v E and (ptme G T^, 

<P false G where ^ = Aii \J AU A U occ{s) U Y such that (p = )3[if v 

then (pirue else (pfaUe]y and ° 

Now let us explain briefly the way in which a new natural number z is 
inserted into a 2-3 tree s. We assume that z does not occur in s. This insertion 
proceeds in two phases. 

First, the attribute insert is called at the root of s with the parameter 
z, i.e., the rewriting starts with the sentential form insert{SyZ). By moving 
from the root of s towards a node which is labeled either by end-two or 
end-three, the attribute insert searches for the so-called insertion point of 
z in s. Informally, the insertion point of z in s is the uniquely determined 
node n of s such that every key which occurs left of n is less than z, and 
every key which occurs right of n is greater than z. We formulate this search 
procedure in the form of sets R\hriV"”'> and 

of rules of a many-sorted macro attributed tree transducer where the auxiliary 
synthesized attribute id computes the identity of its argument (for the sake 
of brevity we have omitted the sorts). In Fig. 8.4 the result of this search 
process is illustrated. 




^insertion, 

two 



insert{TT,yi) 



if yi < id{ir2) 
then insert ( ttI, yi) 
else insert(7r3, yi) 
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jDinsertion, 

^three 

insert{ir^yi) 



reinsertion, 
^end—two ' 



msert(7T, yi) 



if yi < tcf(7r2) 
then insert{'K\^ yi) 
else if yi < id(7r4) 

then msert(7r3,yi) 
else ins6rt(7r5,yi) 



if yi < id(7rl) 

then st£6st(7r, end-three{yi , icf(7rl))) 
else 8ubsi{'K^ end-tAree(id(7rl), yi)) 



reinsertion . 
^end^three' 



msert(7T,yi) 



if yi < id(7rl) 

then re(7T, end-tit;o(yi), id(7rl), end-tit;o(id(7r2))) 
else 

if yi < id(7r2) 

then re(7T, end-tt/;o(id(7rl)), yi , end-tit;o(id(7r2))) 
else re(7T, end-iwo{id{Tr\))^ id{ir2), end-two{yi)) 



As the rules in and H^hreV*^^ show, the attribute insert moves 

down from the root by selecting an appropriate subtree each time it visits a 
node, until it reaches a node labeled either by end-two or end-three. Now the 
second phase starts. If insert has reached a node which is labeled by end-two^ 
then, intuitively, the number z can be added to this node by transforming the 
end-two-node into an end-three-node and filling in z at the appropriate free 
place. Then, the new 2-3 tree has to be generated; this is done by the inherited 
attribute subst which stands for substitution. In Fig. 8.5 the computation of 
subst is illustrated where s[u; x] denotes the tree which is obtained from 
s by replacing the subtree at occurrence w by some variable x. Now we show 
the rules for subst. 



jesubst, 

^wo • 

subst(irl, yi) 
su6st(7r3,yi) 



resubst. 

^three' 



subst{TTl,yi) 
subst{w3y yi) 
su6st(7r5,yi) 



subst{7T^ two{yi , id(7r2), id(7r3))) 
subst{7T^ two{id{wl), fd(7r2), yi)) 



subst{ir, three{yi , id{ir2), id{iri), id(7r4), 2d(7r5))) 
subst{Wy three{id{7rl)y id(7r2), yi , id(7r4), 2d(7r5))) 
subst{'ir^ three(id(7rl), id(7r2), id(7r3), id(7r4), yi)) 



If insert has reached a node which is labeled by end-three^ then there is 
no place in which to insert z at this node. Instead, the 2-3 tree above the 
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insertion point for z has to be rearranged. This rearrangement is performed 
by the inherited attribute re. The rules for re are: 

RZo- 

re{irl,yi,y2,y3) -+ subst{Tr,three{yi,y2,y3,id{n2),id{iri))) 

re{ir3,yi,y2,y3) subst(ir,three{id{Tl),id{Tr2),yi,y2,y3)) 



prc 

•^three’ 

re(jrl,yi,j/2,y3) 

re(7T3,yi,y2,!/3) 

re{irb,yi,y2,y3) 



re(jT, two{yi , y2, ys), id{w2), 
two{id{T3), id{iT4), id(ir5))) 
re(7T, two{id{Trl), id{-ir2), yi), j/2, 
two{y3, id{w4), id(jr5))) 
re{ir,two{id{7rl), id{w2), id{wS)), td(5r4), 
two{yi,y2,y3)) 



Finally, we show the rules for the synthesized attribute id which computes 
the identity of its argument. 



id{n) 



Kiree- 

id(7r) — »• 



^eiid-two' 
id{7r) —)• 

^end— three' 
id(7r) —4^ 



two{id{7rl), id(7r2), id(7r3)) 
three{id{wl), id{ir2), id{ir3), id(7r4), id{w5)) 
end-two{id{wl)) 
end-three{id{Trl), id{7r2)) 
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Note that there is an infinite number of rules of the form id(7r) — ► z. Such 
rules are used to transform the syntactic object z into a semantic object 
G N. 

Now the following question arises: how does the number z which has to 
be inserted come into the macro attributed tree transducer? For the sake of 
convenience, we solve this problem by adding an extra root-marker on 

top of the original input tree and, for every number z, we specify a particular 
macro attributed tree transducer. For two different numbers z\ and z^ the 
two corresponding transducers only differ in one rule in Rroox • Here again we 
should point out to the enquiring reader that we wanted only to show the 
usefulness of macro attributed tree transducers as a compuiaiion model; we 
did not claim that this is a perfect specification language with corresponding 
input/output features. 



Definition 8.9. Let z be a natural number. Construct the many-sorted 
macro attributed tree transducer N = (Atf, i7, Z\,.42,3, in, /?, — ) for the in- 
sertion of z as follows. 

• Att = Attsyn U Atiinh where 

Attsyn = and AtUnh = 

• i7 = U S 2,3 and A = ^ 2,3 where the sorted ranked alphabets 

i? 2,3 and A 2,3 are defined as in Defs. 8.6 and 8.7, respectively. 

• «^ 2,3 is the semantic domain as defined in Definition 8.7. 

• iZ = Rrooi C Rtwo C Rthree C Rend^two C Rend— three C Rz 
and the subsets are defined by 



Rrooi’ 

in(7r) mserf (tt, z) 

re(7Tl,yi,y2,y3) fn;o(yi, y2, ys) 

jy Tyinsertion 1 1 iy$ub 3 t 1 1 pre 1 1 jyid 

^two — ^wo ^ ^wo ^ ^wo ^ ^wo 

jy , jyinsertion 1 1 jysubst 1 1 pre 1 1 ptd 

^three ~ ^three ^ ^three ^ ^three ^ ^three 

Rend— two ~ ^*end-two ^ ^end-two 

jy , , jyinsertion 1 1 ptd 

^end-three — ^end-three ^ ^end-three 

Rz==IP/ □ 

Finally, we show a computation induced by N, Consider the tree 
t = roof(f/iree(end-ft/;o(l),2,end-fii;o(3),4, end-t/iree(5,6))) and the num- 
ber z = 7 to be inserted. Then N computes as follows where we abbreviate 
=>N,t to =>: 
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in{e) 

^ msert(l,7) 
insert(15,7) 

re(15, end-two{id{lbl))^ id(152), end-iwo{J)) 

re(15, end-two{b)^ 6, end-iwo{J)) 

=» re(l,^u;o(id(ll), td(12), id(13)), td(14),^ti;o(en(i-<if;o(5), 6, 
end-twoij))) 

re(l, ^iyo(end-tt(;o(l), 2, end-tii;o(3)), 4, tti;o(encf-^it;o(5), 6, 
encf-^it;o(7))) 

two{two{end-iwo{\)^ 2, enci-^w;o(3)), 4, tti;o(encf-fi/;o(5), 6, 
end-^ti;o(7))) 



8.4 Bibliographic Notes 

The approach of many-sorted tree transducers has been formalized in [Vog91] 
and the embedding of computations in particular domains has been studied 
in [FHVV93]. This enrichment leads to the concept of the many-soried macro 
attributed tree transducer. 

The example of contextual analysis is taken from [Vog91] where this analy- 
sis has been described in terms of top-down tree transducers, macro tree 
transducers, high-level tree transducers, and modular tree transducers. The 
example of insertion into 2-3 trees is due to [Kiih95a]. 

In [Ind85, Ind88] the code generation for an easy programming language 
has been expressed as a system of recursively defined functions. In fact, this 
system can be viewed as a many-sorted macro tree transducer. 

In [BEMV97] a language-based environment is described. The language 
is a functional programming language called NoName that supports the tree 
transducers investigated in this book, in particular, the many-sorted macro 
attributed tree transducers. The software environment contains a syntax- 
directed editor for iNToiSTame-programs [Ern95], a link to an Ingres database 
system which contains standard tree transformations [Miil95], and a transfor- 
mation system which transforms the primitive recursion on trees into iteration 
[MV95a, MV95b]. 
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